Mix-camp E-commerce Search
26th April 2022 - US Edition
Charlottesville, VA - US Edition
MICES started in Berlin, Germany, in 2017. In 2022, it came to the US for the first time. Participants from a variety of backgrounds met in Charlottesville on the day before the Haystack Conference to share and discuss there ideas and thoughts about e-commerce search. Everyone got involved into the many interactive sessions, helping to turn MICES US into a great success.
You will find the slides and videos from the talks in the Programme section below.
MICES brings together participants from a variety of backgrounds, all sharing a common interest in e-commerce search. In order to stimulate and facilitate discussion, we will start with scheduled talks in the morning. This will be followed by self-organising sessions: participants are encouraged to initiate discussions about their topics of interest or to give an ad hoc presentation.
The workshop welcomes all topics related to e-commerce search. We have compiled a list of topics that will probably feature at the workshop. This list is not comprehensive. Participants are welcome to discuss further topics at the workshop.
Welcome by the organizers and logisticsSlides
Co-Founder of MICES / OpenSource Connections
When we started MICES in 2017, one of our main motivations was to meet with other people who work in the field of e-commerce search. We felt that not much had been understood about search in e-commerce. It didn't even seem to be a domain on its own right, especially compared to search in other domains, such as Enterprise Search or general web search.
Five years on, e-commerce search has become much more recognized. Partly, as there is a growing awareness that search is a key factor to a successful online shopping experience, but probably also because the commodization of machine learning and its application to search seems to promise great business yield.
On the other hand, at least in the open source search engine world, the available implementations for ranking are still the ones that were made for document retrieval and not for products. And search optimization work still seems to focus on search relevance but hardly on consumer experience.
In this talk, I shall present a model for understanding e-commerce search that starts from the consumer perspective and that can serve as guidance for search optimization work.
René works as Director E-commerce at OpenSource Connections, leading the company's e-commerce search team and practice.
He has worked in search for more than 15 years, including on projects for some of the top 10 German e-commerce sites. René is co-founder and co-organiser of MICES. His technological focus is on Solr, Elasticsearch and Lucene. He created and maintains the Querqy open source library for query rewriting and he is co-initiator of the Chorus project – an open source software stack that combines Querqy with other powerful tools to build e-commerce search and to measure and improve search quality.
Doug Turnbull & Chen Karako
Imagine being a carpenter with one hand tied behind your back? That’s how it can feel when building e-commerce search without embracing both data and engineering roles.
In this talk, we share a core principle of Shopify’s search team - data and engineering collaboration. In past lives, we’ve seen what happens if you don’t take this ideal to heart: slow, broken machine learning models developed without engineering input crash and burn, we’ve seen engineers tweak relevance uninformed by experimental data – just because the results seem to ‘look good’.
At Shopify search data and engineering collaborate so fluidly, you would be hard pressed to tell who’s a data scientist and who’s an engineer. Rather than search being an arcane specialty, we’ll argue the best, modern relevance specialists, must actually be ‘professional learners’ - eager to embrace both the data scientist and engineer roles.
We go beyond the philosophical, to the practical, organizational work to build this fluidity – whether it’s how we recruit for our team, ensure an environment of safe collaboration, or handle alignment between our organizations, we will share the nitty-gritty, real-life story of how we arrived at a mature, highly collaborative search organization.
In 2012, Doug saw search relevance would be central to user experiences. Sadly, search relevance was a topic clouded in esoteric mystery. He began to democratize this daunting field through blogging and speaking. In 2016, Doug’s book Relevant Search woke the industry up to the importance of search quality. Doug now works at Shopify, working to make great commerce search+discovery possible for everyone. Doug co-created Elasticsearch Learning to Rank with the Wikimedia Foundation and contributed to Trey Grainger’s upcoming AI Powered Search. You can find Doug at his website where he blogs about search and data.
Chen works as Data Science Lead of the Discovery Experience Data team at Shopify, building search engines and recommendation systems to improve product discovery.
Suggested topics for your inspiration:
Successful online retailers need competitive advantages over their contenders. Owning search is a key to success as it enables you to tailor search results to the buyer’s and seller’s interests. But owning a search is not child’s play. Owning your search means taking care of indexing your data, establishing processes around active search management, providing an offline testing framework, generating judgments, … Search teams need to cover a lot of ground to be successful!
But worry no longer, Chorus is here to support these efforts! Two years ago, Chorus was initially released with Solr as its search engine. Now, it’s also available for Elasticsearch and combines best-in-class open source tools to shorten the amount of time you need to set up an e-commerce search stack based on Elasticsearch.
This session will introduce you to this stack containing production-ready tooling for query rewriting, search management, assessing and tuning search relevance. You will learn how you can use this stack to not only own but also optimize your e-commerce search.
Daniel has worked in search since 2012 and is now part of OpenSource Connections’ e-commerce team where he can follow his passion for search and natural language processing. His experience as a search consultant paved the way for becoming an O’Reilly author co-authoring the first German book on Apache Solr. He enjoys combining multiple open source tools like Apache Zeppelin or Apache NiFi to build powerful search stacks to deliver relevant search. In his free time Daniel supports the local fire brigade as a volunteer firefighter and serves as the sports director of the local shooting club in the village he lives in.
During the day we will ask MICES attendees about the toolstack that they use for e-commerce search and what is the one thing that they like the most about it and what is the one thing that they definitely want it to change. In this short session, we will share the results of the survey.
Principal Search Consultant
Relevance algorithms do not end in themselves. They are developed to serve customers and to convey the offering of a company to them. There is a simple recipe to continuously address the needs of customers: keeping improvement iterations as short as possible while testing as much as possible. Algorithms can simply be considered as a transient state in these iterations. This is why the essence of relevance is the continuous improvement, whereas algorithms and techniques are merely details.
In addition to serving customers, companies require relevance engineering to be organizationally scalable in order to accelerate development on demand, e.g. by implementing more features simultaneously or in shorter time. This has become critically important in the last years as many companies have increased their investments into online retail in response to changes of customers' behavior in the context of the pandemic.
The main reason as to why keeping iterations short and organizational scaling are both very challenging in the context of relevance is the complexity that comes with the diversity of expertise that is required to implement search in general. Furthermore, the open source context has been historically grown in a way that relevance is centralized at a single place. However, the only way to tame these challenges is to introduce well-defined interfaces within and especially across teams in order to strive for an independent and efficient relevance engineering.
The talk will evaluate relevance from an end-to-end perspective (including pre- and post-processing of queries) on the basis of real-world examples. This will include discussions as to why machine learning should always be driven by API-first thinking, how serverless technology is able to accelerate relevance engineering significantly, and how re-ranking helps to achieve a better abstracted relevance architecture.
Johannes is a passionate leader, consultant and engineer in the areas of search, data and cloud. He is experienced in driving and implementing solutions from initial ideas towards systems running in productive environments and measurably contributing to business KPIs.
Throughout his entire career, Johannes has been passionate about open source. In his early days, he contributed several features to Apache NiFi to improve its integration with Apache Solr. Later he became a main contributor and committer for Querqy, a query rewriting solution for Apache Solr and Elasticsearch used by various big retail companies worldwide.
There are auto-completion and auto-completions. We can count at least five different types of use cases and implementations. For most of them, a specific dictionary and dedicated index needs to be constructed. In this presentation, we will show the different possibilities Elasticsearch and Solr offer on different use cases of this search before search that we call auto-completion.
Lucian Precup is CTO of all.site - the collaborative search engine developed at Station F in Paris. With his colleagues at Adelean, Lucian develops solutions for indexing, searching and analyzing data. Lucian regularly shares his knowledge in specialized conferences and organizes the Search & Data Meetup in Paris.
Radu Pop provides Consulting Services as Solutions Architect at Adelean. He handles projects around Elasticsearch and Adelean’s A2 search technology. He oversees the integration and evolution of search engines within large e-commerce platforms, marketplaces or organizations' data lakes. Prior to joining Adelean, Radu acquired a solid experience in Web archiving, operating large scale crawling systems in the context of several European research projects. He holds a PhD in Computer Science and a MSc in Distributed Systems.
We would like to invite you to share your case study, insights, ideas, or research related to e-commerce search with other participants at MICES. You may submit a proposal for a talk as part of the scheduled sessions, which take place in the morning and in the early afternoon. As the number of scheduled talks is limited, you might also consider pitching your topic during the Open Space (Barcamp) sessions which take place in the afternoon.
We especially welcome women working in e-commerce search to propose a talk.
Please submit your proposal for a scheduled talk following one of the (external) links below. Talks will be limited to 30 minutes, including discussion. We will not accept sales talks or product promotions.
Suggest your talk for MICES Europe, Berlin, 15th June
- This call for talks ends on Thuesday, 9th May
René works Director E-commerce at OpenSource Connections. As a search consultant he has supported clients in Germany and abroad for 15 years. Although he is interested in all aspects of search, key areas include search relevance consulting, e-commerce search and Apache Solr/Lucene. René maintains the Querqy open source library. He co-founded MICES in 2017 together with Paul Bartusch and Isabell Drost-From.
Paul is a passionate and visionary product owner with 15+ years experience in development & management of digital products. Coming from an engineering background, he has been working in various industries (e-commerce, search, music, IoT), teaching at university and been active in the open source community. As the founder of productful, Paul is a strong believer that data - together with foundational UX insights - will carve out meaningful ∓ superior digital products. Paul co-founded MICES in 2017 together with René Kriegler and Isabell Drost-From.
MICES is partnering with 'Haystack - The Search Relevance Conference'. MICES US takes place on the day before the Haystack conference. See here for information.