Automating Valuations for Real-Estate

More Info
expand_more

Abstract

As GeoPhy is developing its business model and looking into the future of automated valu- ation models (AVM), this project delivers a proof of concept of a system that automates the training, maintaining, and delivery of machine learning models for automated valuations. In order to achieve this goal, the situation and problem were first analysed. This resulted in an outline of the desired product and requirements in the form of a MoSCoW analysis. An important goal for this project was to incorporate streams of data from a stream processing platform (Apache Kafka) into a service that would train and update models automatically. The second goal for this project was to keep track of the changes in the data in order to detect significant changes in distribution (concept drift) of the target prediction value.

These subjects were studied in literature, reviewing existing and upcoming valuation prac- tices in real-estate, steps needed to perform machine learning tasks, architecture to support big data processing, and concept drift. This resulted in a design made up of four different components: An ETL and data processing component, a modelling component, a Kafka con- nector, and a client-facing API. An important part to ensure efficiency and scalability of the system is the implementation of concept drift: models are only retrained when the distribu- tion of the target training value has changed significantly.

These components use storage in the form of a Postgres database, disk storage and Elastic Search logs. The logs (on model performance and concept drift usage) can be interpreted through a Grafana dashboard, which is editable through its own GUI.

Finally, to test the success of the project, a testing plan was set up and the code was reviewed by an external group (SIG). The code achieved all the testing milestones and received a 4.5/5 in a mid-development review on maintainability. With this project, the concept of automated valuation models inside GeoPhy’s new architecture has been tested and proved and the project is ready to be further developed and used in practice.