Omnicom Media Group (OMG) is a company heavily involved in marketing and advertising. Our client is Annalect, a solutions provider that helps the marketers of OMG to make data actionable. OMG has processed cookie data to help their marketers set up advertisement campaigns. They b
...
Omnicom Media Group (OMG) is a company heavily involved in marketing and advertising. Our client is Annalect, a solutions provider that helps the marketers of OMG to make data actionable. OMG has processed cookie data to help their marketers set up advertisement campaigns. They buy this cookie data from a 3rd party. They also manage, however, a vast amount of cookie data themselves, which is currently partly unused. In order for the marketers to use this data, Annalect needs to process and prepare database views for them. To let the marketers, who have no database knowledge, be able to manipulate these views, they create dashboards with 3rd party software called Tableau. Annalect wants us to create an application in which they can set up these dashboards for the marketers so they can manipulate the cookie data and use it for their advertisement campaigns. As a result we have created a web application which supports the workflow of the marketer. After some setup by the people from Annalect, a marketer can sign into our application, choose a dashboard, start working with the cookie data and send the manipulated data off to create an advertisement campaign. All this is done without leaving our application. The data the marketers manipulate in our application is just a small snippet of the complete data set. Since the complete data set contains much more data, it needs to be processed by server clusters paid for by Annalect. This processing is done at night, in order to cut the cost of running the server cluster. After the processing, the result has to be sent to Google DoubleClick Campaign Manager. Furthermore, the ability to use machine learning algorithms was requested by Annalect. This has been implemented through a generic pipeline, which supports multiple machine learning models. A model based on gradient boosting is included as a proof of concept. In order to evaluate the application some tests were done. Different aspects need different tests. Firstly, a usability test was performed with the end users to test the User Interface. Secondly, unit tests were made where it was applicable. Lastly, the machine learning model was evaluated using the recall precision method and K-fold cross validation method.
The application has some aspects which have ethical interest. Managing vast amounts of cookie data needs to be done discretely as personal information can be derived from such data. Having cookie data leak can cause damage to individuals who supplied this data. Besides that, the right to explanation law coming into effect next year will force companies to explain why their computer models made certain decisions or classifications. This has implications for machine learning models used by our application. And finally, the users of the program have to be aware that they are using sensitive data about individuals, which they have to act upon accordingly.