Quora_QuestionPair

Employed Natural Language Processing techniques such as removal of Stopwords, Punctuations and Hyperlinks to prepare the Dataset(consisting 404290 rows) and also applied techniques such as Tokenization and Stemming • Extracted Basic features and Advance Features consisting of Fuzz features and explored the features importance • Transformed the texts to numerical vectors using TF-IDF Vectorizer and fitted Logistic Regression and Xgboost Model and did Hyperparameter Tuning on Xgboost model to get Auc score of 0.91 and accuracy 83%

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Quora Question Pair Completed project.ipynb		Quora Question Pair Completed project.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Quora_QuestionPair

About

Uh oh!

Releases

Packages

Uh oh!

Languages

sonalgaud12/Quora_QuestionPair

Folders and files

Latest commit

History

Repository files navigation

Quora_QuestionPair

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages