This project is about analyzing consumer complaints from Indian consumer complaint forum(https://www.consumercomplaints.in/). Beautifulsoup and urllib is used to crawl data from site and parse the html pages to extract actual complaint data. Data is stored in JSON format along with complaint properties like title, created by, and created date, complaint details.
#Feaures
#Tools used
- urllib to get contents from a url.
- Beautifulsoup for parsing html content.
- Evaluating Tensorflow, scikit learn for classification and text summary.
#TODO
- Classifying complaints to different categories like banking, e-commerce.
- Summarize consumer complaint details.
- Predicting complaint resolve rate if a new complaint is posted on forum.
