1- Islamic Azad University 2- Iran University of Science and Technoligy , rfarnoosh@iust.ac.ir
Abstract: (745 Views)
This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient Boosting Regression Algorithm (XGBR), and the Long Short-Term Memory Neural Network Algorithm (LSTM). This research has been done using the data of the Statistics Center of Iran, which contains information on the purchase and sale of residential units in Tehran in the years 2014 to 2020 and includes 998299 transactions and 11 features. Loss of data, batch data conversion, normalization, etc. are performed on the housing data set to obtain the final and error-free data set. To divide the data set into training and test data sets, the important and practical method of cross-validation or K-Fold has been used because of its simplicity and effectiveness and as a universally valid method. Various evaluation criteria such as MSE, RMSE, MAE,ME and R2 were used to compare the models and identify the best model. Comparison of models in terms of all evaluation criteria in all K-fold subsets proves the stability and superiority of the Extreme Gradient Boosting Regression model.
Type of Study: Applicable |
Subject: General Received: 2022/05/25 | Accepted: 2022/02/22 | Published: 2020/08/22
References
1. Abdel-Nasser, M., and Mahmoud, K. (2019). Accurate photovoltaic power forecasting models using deep LSTM-RNN. Neural Computing and Applications, 31, 2727-2740. [DOI:10.1007/s00521-017-3225-z]
2. Abidoye, R.B., and Chan, A.P. (2017). Modelling property values in Nigeria using artificial neural network. Journal of Property Research, 34, 36-53. [DOI:10.1080/09599916.2017.1286366]
3. Abidoye, R.B., and Chan, A.P. (2018). Improving property valuation accuracy: A comparison of hedonic pricing model and artificial neural network. Pacific Rim Property Research Journal, 24, 71-83. [DOI:10.1080/14445921.2018.1436306]
4. Ahmad, M.W., Reynolds, J., and Rezgui, Y. (2018). Predictive modelling for solar thermal energy systems: A comparison of support vector regression, random forest, extra trees and regression trees. Journal of cleaner production, 203, 810-821. [DOI:10.1016/j.jclepro.2018.08.207]
5. Ahmad, M.S., Adnan, S.M., Zaidi, S., and Bhargava, P. (2020). A novel support vector regression (SVR) model for the prediction of splice strength of the unconfined beam specimens. Construction and building materials, 248, 118475. [DOI:10.1016/j.conbuildmat.2020.118475]
6. Bai, Y., Xie, J., Liu, C., Tao, Y., Zeng, B., and Li, C. (2021). Regression modeling for enterprise electricity consumption: A comparison of recurrent neural network and its variants. International Journal of Electrical Power & Energy Systems, 126, 106612. [DOI:10.1016/j.ijepes.2020.106612]
8. Chen, T., and Guestrin, C. (2016). Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, (pp. 785-794). [DOI:10.1145/2939672.2939785]
9. El Hamzaoui, Y., and Perez, J.A.H. (2011, November). Application of artificial neural networks to predict the selling price in the real estate valuation process. In 2011 10th Mexican International Conference on Artificial Intelligence (pp. 175-181). IEEE. [DOI:10.1109/MICAI.2011.14]
10. Hamed, Y., Shafie, A.F., Mustaffa, Z., and Rusma, N. (2019). Error-reduction approach for corrosion measurements of pipeline inline inspection tools. Measurement and Control, 52, 28-36. [DOI:10.1177/0020294018813643]
11. Hamed, Y., Alzahrani, A.I., Mustaffa, Z., Ismail, M.C., and Eng, K.K. (2020). Two steps hybrid calibration algorithm of support vector regression and K-nearest neighbors. Alexandria Engineering Journal, 59, 1181-1190. [DOI:10.1016/j.aej.2020.01.033]
12. Ho, W.K., Tang, B.S., and Wong, S.W. (2021). Predicting property prices with machine learning algorithms. Journal of Property Research, 38, 48-70. [DOI:10.1080/09599916.2020.1832558]
13. Huang, G., Wu, L., Ma, X., Zhang, W., Fan, J., Yu, X., and Zhou, H. (2019). Evaluation of CatBoost method for prediction of reference evapotranspiration in humid regions. Journal of Hydrology, 574, 1029-1041. [DOI:10.1016/j.jhydrol.2019.04.085]
14. Kang, J., Lee, H.J., Jeong, S.H., Lee, H.S., and Oh, K.J. (2020). Developing a forecasting model for real estate auction prices using artificial intelligence. Sustainability, 12, 2899. [DOI:10.3390/su12072899]
15. Kim, M.K., Kim, Y.S., and Srebric, J. (2020). Predictions of electricity consumption in a campus building using occupant rates and weather elements with sensitivity analysis: Artificial neural network vs. linear regression. Sustainable Cities and Society, 62, 102385. [DOI:10.1016/j.scs.2020.102385]
16. Li, Y., Zou, C., Berecibar, M., Nanini-Maury, E., Chan, J.C.W., Van den Bossche, P., and Omar, N. (2018). Random forest regression for online capacity estimation of lithium-ion batteries. Applied energy, 232, 197-210. [DOI:10.1016/j.apenergy.2018.09.182]
17. Lu, S., Li, Z., Qin, Z., Yang, X., and Goh, R. S. M. (2017). A hybrid regression technique for house prices prediction. In 2017 IEEE international conference on industrial engineering and engineering management (IEEM) (pp. 319-323). IEEE. [DOI:10.1109/IEEM.2017.8289904]
18. Ma, B., Meng, F., Yan, G., Yan, H., Chai, B., and Song, F. (2020). Diagnostic classification of cancers using extreme gradient boosting algorithm and multi-omics data. Computers in biology and medicine, 121, 103761. [DOI:10.1016/j.compbiomed.2020.103761]
19. Madhuri, C.R., Anuradha, G., and Pujitha, M.V. (2019). House price prediction using regression techniques: a comparative study. In 2019 International conference on smart structures and systems (ICSSS) (pp. 1-5). IEEE. [DOI:10.1109/ICSSS.2019.8882834]
20. Manasa, J., Gupta, R., and Narahari, N.S. (2020, March). Machine learning based predicting house prices using regression techniques. In 2020 2nd International conference on innovative mechanisms for industry applications (ICIMIA) (pp. 624-630). IEEE. [DOI:10.1109/ICIMIA48430.2020.9074952]
21. Morano, P., Tajani, F., and Carmelo, M.T. (2015). Artificial Intelligence in Property Valuation. Advances in Environmental Science and Energy Planning.
22. Peter, N.J., Okagbue, H.I., Obasi, E.C., and Akinola, A.O. (2020). Review on the application of artificial neural networks in real estate valuation. International Journal of Advanced Trends in Computer Science and Engineering, 9, 2918 - 2925. [DOI:10.30534/ijatcse/2020/66932020]
23. Punia, S., Nikolopoulos, K., Singh, S.P., Madaan, J.K., and Litsiou, K. (2020). Deep learning with long short-term memory networks and random forests for demand forecasting in multi-channel retail. International journal of production research, 58, 4964-4979. [DOI:10.1080/00207543.2020.1735666]
24. Rafiei, M.H., and Adeli, H. (2016). A novel machine learning model for estimation of sale prices of real estate units. Journal of Construction Engineering and Management, 142, 04015066. [DOI:10.1061/(ASCE)CO.1943-7862.0001047]
25. Rodriguez-Galiano, V., Sanchez-Castillo, M., Chica-Olmo, M., and Chica-Rivas, M.J.O.G.R. (2015). Machine learning predictive models for mineral prospectivity: An evaluation of neural networks, random forest, regression trees and support vector machines. Ore Geology Reviews, 71, 804-818. [DOI:10.1016/j.oregeorev.2015.01.001]
26. Varma, A., Sarma, A., Doshi, S., and Nair, R. (2018). House price prediction using machine learning and neural networks. In 2018 second international conference on inventive communication and computational technologies (ICICCT) (pp. 1936-1939). IEEE. [DOI:10.1109/ICICCT.2018.8473231]
27. Wang, W.C., Chang, Y.J., and Wang, H.C. (2019). An application of the spatial autocorrelation method on the change of real estate prices in Taitung City. ISPRS International Journal of Geo-Information, 8(6),249. [DOI:10.3390/ijgi8060249]
Farahzadi M, Farnoosh R, Behzadi M H. Machine Learning Models for Housing Prices Forecasting using Registration Data. JSRI 2020; 17 (1) :191-214 URL: http://jsri.srtc.ac.ir/article-1-420-en.html