geoRent_predictionAI aims to help newcomers in Japan understand the factors influencing rental prices, enabling them to make informed decisions about their living arrangements. The project delves into the specific relationships between price and various features, such as:
- What factors contribute to low initial deposits?
- How does the distance to the nearest station and proximity to downtown affect rental prices differently, and by how much?
- etc.. name your relasion By balancing walkability and property location, you can find the best value for your budget. Or discover low-deposit options that meet your needs.
- Data Collection: Scraped data from 30,883 rental properties across 2,235 pages using
BeautifulSoup
. - Data Preprocessing:
- Removed irrelevant features and handled missing values using SQLite and Pandas.
- Created new variables, including building age and dummy variables
- handling categorical and numric values globally
- Visualization:
- Rental trends and correlations visualized using bar graphs, box plots, and heatmaps.
- Model Development:
- A feature selection model achieved an R² of 86% of accuracy.
- Web Deployment:
- User-friendly prediction app deployed via Gradio.
- Data scraped from Real Estate Japan.
- improved scapping performance by using muliti thered scapping, now whole website can be capped under 1 hour.
- Added dummy variables for categorical data and standardized numerical features.
- Correlation:
- Correlation between features and price.
- WEBUI:
- a gardio interface.
Metric | Value (JPY) |
---|---|
R² | 0.86 |
The project includes a deployed web interface that enables users to:
- Input property features like building age, square footage, and proximity to stations.
- Obtain correlations between features..
- First-time Renters: Understand market situation for planning purposes.
- Investors and Agents: Strategically evaluate rental properties.
- Selecting "best match" houses by combining the dataset with Google Maps API.
- Explore alternative machine learning models for added functionality.
- Expand the dataset to include Suumo data
- "fair price" checker
- coco2525remixed scapping code and readme.md