California housing price regression

Predicting the prices of housing in different block groups accross California with regression modelling based on the California Housing Dataset (1990). The intended method of regression is linear regression for the sake of statstical modelling under the the restrictions of the Ordinary least squares algorithm. Several other regression algorithms such as decision tree regression were used for comparison, and finally a best performing model was obtained without the restrictions of OLS. The dataset contains sociodemographic, real estate and geographical data. Models were assessed in terms of error metrics and mean differences from actual values.

Dataset source

This dataset is a modified version of the California Housing dataset available from Luís Torgo's page (University of Porto). Luís Torgo obtained it from the StatLib repository (which is closed now). The dataset may also be downloaded from StatLib mirrors.

This dataset appeared in a 1997 paper titled Sparse Spatial Autoregressions by Pace, R. Kelley and Ronald Barry, published in the Statistics and Probability Letters journal. They built it using the 1990 California census data. It contains one row per census block group. A block group is the smallest geographical unit for which the U.S. Census Bureau publishes sample data (a block group typically has a population of 600 to 3,000 people).

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
images		images
.gitattributes		.gitattributes
Data modelling variation.ipynb		Data modelling variation.ipynb
Data modelling.ipynb		Data modelling.ipynb
Data preparation.ipynb		Data preparation.ipynb
Data understanding I.ipynb		Data understanding I.ipynb
Data understanding II.ipynb		Data understanding II.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

California housing price regression

Dataset source

Contents

About

Uh oh!

Releases

Packages

Languages

jerrold110/Regression-housing-prices

Folders and files

Latest commit

History

Repository files navigation

California housing price regression

Dataset source

Contents

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages