[Home ] [Archive]    
Main Menu
Journal Information::
Home::
Archive::
For Authors::
For Reviewers::
Principles of Transparency::
Contact us::
::
Search in website

Advanced Search
..
Committed to

AWT IMAGE

Attribution-NonCommercial
CC BY-NC


AWT IMAGE

Open Access Publishing


AWT IMAGE

Prevent Plagiarism

..
Registered in


..
Statistics
Journal volumes: 17
Journal issues: 34
Articles views: 683905
Articles downloads: 342215

Total authors: 581
Unique authors: 422
Repeated authors: 159
Repeated authors percent: 27

Submitted articles: 368
Accepted articles: 266
Rejected articles: 25
Published articles: 219

Acceptance rate: 72.28
Rejection rate: 6.79

Average Time to Accept: 282 days
Average Time to First Review: 27.2 days
Average Time to Publish: 26.1 days

Last 3 years statistics:
Submitted articles: 54
Accepted articles: 37
Rejected articles: 6
Published articles: 17

Acceptance rate: 68.52
Rejection rate: 11.11

Average Time to Accept: 205 days
Average Time to First Review: 6.7 days
Average Time to Publish: 118 days
____
..
:: Volume 17, Issue 1 (8-2020) ::
JSRI 2020, 17(1): 191-214 Back to browse issues page
Machine Learning Models for Housing Prices Forecasting using Registration Data
Mehdi Farahzadi1 , Rahman Farnoosh 2, Mohammad Hassan Behzadi1
1- Islamic Azad University
2- Iran University of Science and Technoligy , rfarnoosh@iust.ac.ir
Abstract:   (745 Views)
This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient Boosting Regression Algorithm (XGBR), and the Long Short-Term Memory Neural Network Algorithm (LSTM). This research has been done using the data of the Statistics Center of Iran, which contains information on the purchase and sale of residential units in Tehran in the years 2014 to 2020 and includes 998299 transactions and 11 features. Loss of data, batch data conversion, normalization, etc. are performed on the housing data set to obtain the final and error-free data set. To divide the data set into training and test data sets, the important and practical method of cross-validation or K-Fold has been used because of its simplicity and effectiveness and as a universally valid method. Various evaluation criteria such as MSE, RMSE, MAE,ME and R2 were used to compare the models and identify the best model. Comparison of models in terms of all evaluation criteria in all K-fold subsets proves the stability and superiority of the Extreme Gradient Boosting Regression model.
 
Keywords: Housing price forecasting, nearest neighbor regression, random forest regression, support vector regression, long short-Term memory neural network, and extreme gradient boosting regression.​​​​​
Full-Text [PDF 1005 kb]   (723 Downloads)    
Type of Study: Applicable | Subject: General
Received: 2022/05/25 | Accepted: 2022/02/22 | Published: 2020/08/22
References
1. Abdel-Nasser, M., and Mahmoud, K. (2019). Accurate photovoltaic power forecasting models using deep LSTM-RNN. Neural Computing and Applications, 31, 2727-2740. [DOI:10.1007/s00521-017-3225-z]
2. Abidoye, R.B., and Chan, A.P. (2017). Modelling property values in Nigeria using artificial neural network. Journal of Property Research, 34, 36-53. [DOI:10.1080/09599916.2017.1286366]
3. Abidoye, R.B., and Chan, A.P. (2018). Improving property valuation accuracy: A comparison of hedonic pricing model and artificial neural network. Pacific Rim Property Research Journal, 24, 71-83. [DOI:10.1080/14445921.2018.1436306]
4. Ahmad, M.W., Reynolds, J., and Rezgui, Y. (2018). Predictive modelling for solar thermal energy systems: A comparison of support vector regression, random forest, extra trees and regression trees. Journal of cleaner production, 203, 810-821. [DOI:10.1016/j.jclepro.2018.08.207]
5. Ahmad, M.S., Adnan, S.M., Zaidi, S., and Bhargava, P. (2020). A novel support vector regression (SVR) model for the prediction of splice strength of the unconfined beam specimens. Construction and building materials, 248, 118475. [DOI:10.1016/j.conbuildmat.2020.118475]
6. Bai, Y., Xie, J., Liu, C., Tao, Y., Zeng, B., and Li, C. (2021). Regression modeling for enterprise electricity consumption: A comparison of recurrent neural network and its variants. International Journal of Electrical Power & Energy Systems, 126, 106612. [DOI:10.1016/j.ijepes.2020.106612]
7. Breiman, L. (2001). Random forests. Machine learning, 45, 5-32. [DOI:10.1023/A:1010933404324]
8. Chen, T., and Guestrin, C. (2016). Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, (pp. 785-794). [DOI:10.1145/2939672.2939785]
9. El Hamzaoui, Y., and Perez, J.A.H. (2011, November). Application of artificial neural networks to predict the selling price in the real estate valuation process. In 2011 10th Mexican International Conference on Artificial Intelligence (pp. 175-181). IEEE. [DOI:10.1109/MICAI.2011.14]
10. Hamed, Y., Shafie, A.F., Mustaffa, Z., and Rusma, N. (2019). Error-reduction approach for corrosion measurements of pipeline inline inspection tools. Measurement and Control, 52, 28-36. [DOI:10.1177/0020294018813643]
11. Hamed, Y., Alzahrani, A.I., Mustaffa, Z., Ismail, M.C., and Eng, K.K. (2020). Two steps hybrid calibration algorithm of support vector regression and K-nearest neighbors. Alexandria Engineering Journal, 59, 1181-1190. [DOI:10.1016/j.aej.2020.01.033]
12. Ho, W.K., Tang, B.S., and Wong, S.W. (2021). Predicting property prices with machine learning algorithms. Journal of Property Research, 38, 48-70. [DOI:10.1080/09599916.2020.1832558]
13. Huang, G., Wu, L., Ma, X., Zhang, W., Fan, J., Yu, X., and Zhou, H. (2019). Evaluation of CatBoost method for prediction of reference evapotranspiration in humid regions. Journal of Hydrology, 574, 1029-1041. [DOI:10.1016/j.jhydrol.2019.04.085]
14. Kang, J., Lee, H.J., Jeong, S.H., Lee, H.S., and Oh, K.J. (2020). Developing a forecasting model for real estate auction prices using artificial intelligence. Sustainability, 12, 2899. [DOI:10.3390/su12072899]
15. Kim, M.K., Kim, Y.S., and Srebric, J. (2020). Predictions of electricity consumption in a campus building using occupant rates and weather elements with sensitivity analysis: Artificial neural network vs. linear regression. Sustainable Cities and Society, 62, 102385. [DOI:10.1016/j.scs.2020.102385]
16. Li, Y., Zou, C., Berecibar, M., Nanini-Maury, E., Chan, J.C.W., Van den Bossche, P., and Omar, N. (2018). Random forest regression for online capacity estimation of lithium-ion batteries. Applied energy, 232, 197-210. [DOI:10.1016/j.apenergy.2018.09.182]
17. Lu, S., Li, Z., Qin, Z., Yang, X., and Goh, R. S. M. (2017). A hybrid regression technique for house prices prediction. In 2017 IEEE international conference on industrial engineering and engineering management (IEEM) (pp. 319-323). IEEE. [DOI:10.1109/IEEM.2017.8289904]
18. Ma, B., Meng, F., Yan, G., Yan, H., Chai, B., and Song, F. (2020). Diagnostic classification of cancers using extreme gradient boosting algorithm and multi-omics data. Computers in biology and medicine, 121, 103761. [DOI:10.1016/j.compbiomed.2020.103761]
19. Madhuri, C.R., Anuradha, G., and Pujitha, M.V. (2019). House price prediction using regression techniques: a comparative study. In 2019 International conference on smart structures and systems (ICSSS) (pp. 1-5). IEEE. [DOI:10.1109/ICSSS.2019.8882834]
20. Manasa, J., Gupta, R., and Narahari, N.S. (2020, March). Machine learning based predicting house prices using regression techniques. In 2020 2nd International conference on innovative mechanisms for industry applications (ICIMIA) (pp. 624-630). IEEE. [DOI:10.1109/ICIMIA48430.2020.9074952]
21. Morano, P., Tajani, F., and Carmelo, M.T. (2015). Artificial Intelligence in Property Valuation. Advances in Environmental Science and Energy Planning.
22. Peter, N.J., Okagbue, H.I., Obasi, E.C., and Akinola, A.O. (2020). Review on the application of artificial neural networks in real estate valuation. International Journal of Advanced Trends in Computer Science and Engineering, 9, 2918 - 2925. [DOI:10.30534/ijatcse/2020/66932020]
23. Punia, S., Nikolopoulos, K., Singh, S.P., Madaan, J.K., and Litsiou, K. (2020). Deep learning with long short-term memory networks and random forests for demand forecasting in multi-channel retail. International journal of production research, 58, 4964-4979. [DOI:10.1080/00207543.2020.1735666]
24. Rafiei, M.H., and Adeli, H. (2016). A novel machine learning model for estimation of sale prices of real estate units. Journal of Construction Engineering and Management, 142, 04015066. [DOI:10.1061/(ASCE)CO.1943-7862.0001047]
25. Rodriguez-Galiano, V., Sanchez-Castillo, M., Chica-Olmo, M., and Chica-Rivas, M.J.O.G.R. (2015). Machine learning predictive models for mineral prospectivity: An evaluation of neural networks, random forest, regression trees and support vector machines. Ore Geology Reviews, 71, 804-818. [DOI:10.1016/j.oregeorev.2015.01.001]
26. Varma, A., Sarma, A., Doshi, S., and Nair, R. (2018). House price prediction using machine learning and neural networks. In 2018 second international conference on inventive communication and computational technologies (ICICCT) (pp. 1936-1939). IEEE. [DOI:10.1109/ICICCT.2018.8473231]
27. Wang, W.C., Chang, Y.J., and Wang, H.C. (2019). An application of the spatial autocorrelation method on the change of real estate prices in Taitung City. ISPRS International Journal of Geo-Information, 8(6),249. [DOI:10.3390/ijgi8060249]
Send email to the article author

Add your comments about this article
Your username or Email:

CAPTCHA


XML     Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Farahzadi M, Farnoosh R, Behzadi M H. Machine Learning Models for Housing Prices Forecasting using Registration Data. JSRI 2020; 17 (1) :191-214
URL: http://jsri.srtc.ac.ir/article-1-420-en.html


Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Volume 17, Issue 1 (8-2020) Back to browse issues page
مجله‌ی پژوهش‌های آماری ایران Journal of Statistical Research of Iran JSRI
Persian site map - English site map - Created in 0.05 seconds with 42 queries by YEKTAWEB 4645