Software Developer Intern, developed a NLP based Text Similarity Model to compare Resumes with Job Description and speed up the hiring process, and reduce bias while selecting candidates.
- Resumes were scraped from Resdex's free resume search (personal data redacted before use)
- With the help of textract, all the resume's and job description were extracted and preprocessed using NLTK, Spacy and Gensim library.
- For LDA, and Topic Modelling, Gensim's TF-IDF and LDA implementation was applied.
- However another approach using Gensim's Word2Vec and Cosine Similarity was used, but due to the time taken for creating document specific vectors for each resume vs. the resources available, it was not implemented.
- Rest of the project is presented in the form of a Dashboard using Streamlit.io and Plotly's Graphing Library.
python machine learning data visualization data analysis