You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A complete NLP pipeline that predicts the primary programming language used in GitHub repositories. The project includes automated web scraping, text extraction from READMEs and source files, feature engineering using TF-IDF, and a classification model to determine the technology stack.