The Czech company Carvago runs a very successful online marketplace for used cars from all over Europe. Til now it offers around 1 million cars to customers from the Czech Republic, Slovakia, and Germany, with other European countries to follow in the near future.
In order to achieve these goals, each day Carvago needs to find used cars on offer all across Europe and include these in their offer.
To ensure their customers find the best deal for the car they want at any time, Carvago needs to constantly update its database with newly offered cars on all portals around Europe. Often their providers do not fill in the details of the car they are offering, or may place an advert on more than one portal at the same time.
So besides retrieving new listings, the system needs to detect and remove duplicates, correct incorrect information, and classify all offers on the basis of an exhaustive catalogue of car models, including key parameters such as motor and gearbox type, drivetrain, et al.
The main requirements for the solution were as follows
We analyzed in detail the various existing sources of information on cars and advertisements, and identified:
Interesting fact: 10 car models cover 37% of the market.
Revolt.BI's solution for Carvago comprises several components
We chose Keboola as our Data Warehouse and DevOps platform, based on Keboola's excellent computing power, integration of all necessary services, diagnostics of all processes and many other advantages. For data storage we settled on Snowflake.
For automated photo analysis & image recognition we use deep learning - a convolutional neural network (CNN) - which is able to identify objects and many other types of features in an image and draw conclusions by analyzing them at low cost, thanks to a unique set of efficient algorithms and technologies. Our image analytics solution is also able to correct erroneous information - for example recognizing from the car's photo that it's a station wagon, even though the description states it as a VAN or MPV. We can even automatically detect the type of air conditioning from the interior photo!
Interesting fact: To train the neural network well for one model, 2000 photos are needed.
We handle business analytics using Tableau - no other visualization tool could handle so easily the many different views and aspects needed for Carvago's daily operations, not only for the company itself, but also for its business partners.
By working with Revolt.BI, Carvago gained unique and always up-to-date data on European used cars, including relevant parameters and price of each car in question, along with analytical tools to assist in their business use.
Revolt.BI's business analytics enable Carvago's sales department and its customers to make data-driven decisions, such as targeting dealers according to the strength of their segments or making detailed comparisons of the cars on offer across ad servers.