Hello everyone. I’ve come up with an idea for a rapid personalization engine for e-commerce, for which I need live traffic, big data. I would be very grateful if someone could send me the logs.
Any e-commerce logs that can be used to learn something and based on which some demonstrable conclusions can be made are suitable (for example, if a customer only browses a certain section or visits products from one group, they should be categorized under “favorite customers of this group”).
The only complexity is that most logs do not have a customer identifier. However, they do have IPs, which will suffice for now. But if anyone has customer IDs, that would be just perfect.
Ideally, I would also like to get data on orders. I don’t need surnames, first names, emails, or addresses. Even the names of the products are not necessary.
As a result, I plan to create a prototype system, which will receive messages from the e-commerce site, simplistically in the form of logs, and immediately, in real time, rules will be applied (I plan to use the Drools engine) such as: “[when] the customer has visited 10 pages from the sports section, [then] they should be shown a sports banner.” Information that customer X should be shown such a banner is sent back to the site, and the next time a page with banners loads, it will be displayed.
So, there is no such system yet, but I am sure I can build a prototype over the weekend or a little more. I am concerned about performance. I need to test all this with large volumes, a high frequency of events. That’s why I need the logs.
I will publish the results on a blog about hybris/e-commerce. I can either disclose or not disclose your data – it depends on your preference. Among the nearest topics related to this on the site, there is a recommendation system and three articles about Drools in e-commerce.
