The workshop on Personalization, Recommendation and Search (PRS) aims at bringing together practitioners and researchers in these three domains. The goal of this workshop is to facilitate the sharing of information and practices, as well as finding bridges between these communities and promoting discussion.
Please register in advance through the RSVP button above. We'll close registrations on June 1st or when we reach capacity.
If you are interested in presenting a poster at the workshop, please fill out this form before May 22nd.
The event will be in-person, at our Netflix campus in Los Gatos, CA.
This @NetflixResearch workshop is organized by:
  Justin Basilico - jbasilico[at]netflix.com
  Linas Baltrunas - lbaltrunas[at]netflix.com
  Grace Huang - ghuang[at]netflix.com
  Aish Fenton - afenton[at]netflix.com
  Sudarshan Lamkhede - slamkhede[at]netflix.com
  Patric Glynn - pglynn[at]netflix.com
Previous PRS workshops: 2022, 2021, 2019, 2018, 2017, 2016.
Jiajing Xu heads the Applied Science team at Pinterest that brings cutting-edge solutions to the company’s most pressing challenges. The team tackles challenges in the areas of representation learning, recommendation system, graph neural network, ML fairness, and inclusive AI. Prior to his current role, he co-founded the visual discovery team at Pinterest and managed the Ads Ranking team. He holds a Ph.D. and a Master’s degree from Stanford University, and Bachelor’s degree from California Institute of Technology.
 Sakshi Jain leads the Data efforts in Responsible AI @ LinkedIn. Sakshi is spearheading the definition and operationalization of Responsible AI principles like fairness, transparency and inclusivity across major AI products @ LinkedIn. Prior to this, she has spent 6+ years working at the intersection of Trust & AI, protecting LinkedIn's products against large scale adversarial attacks.
Dr. Hongning Wang is now the Copenhaver Associate Professor in the Department of Computer Science at the University of Virginia. He received his PhD degree in computer science at the University of Illinois at Champaign-Urbana in 2014. His research generally lies in the intersection among machine learning, data mining and information retrieval, with a special focus on sequential decision optimization and computational user modeling. His work has generated over 80 research papers in top venues in data mining and information retrieval areas. He is a recipient of 2016 National Science Foundation CAREER Award, 2020 Google Faculty Research Award, and SIGIR’2019 Best Paper Award.
Maryam Esmaeili is a Research Scientist at Netflix, where she specializes in solving problems related to personalization and recommendation systems. She earned her PhD in Computer Science from the University of Massachusetts in Amherst and the Dalle Molle Institute for Artificial Intelligence Research (IDSIA) in Switzerland. Before her current position at Netflix, Maryam worked at Amazon, where she focused on developing recommendation systems for Amazon Prime.
Yesu Feng is a Research Scientist at Netflix, his current work focuses on personalized video recommendation and large scale sequential recommendation. He got his PhD in Physics from Duke University. He previously worked at Linkedin, where he focused on homepage feed recommendation. Before joining Netflix, he was at Uber, where his work focused on driver behavior and supply prediction.Â
Â
Minmin Chen is a Research Scientist in Google. She received her PhD from Washington University in St. Louis. Her main research interests are in reinforcement learning and bandits algorithms and their applications in recommender systems. She is passionate about bringing latest ML technologies to real life, especially to transform recommendation experiences. She publishes at ML and RecSys conferences, and regularly serves as area chair or senior program committee for NeurIPS, ICML and ICLR.
Chen Karako is a Senior Data Science Manager at Shopify, where she leads the Discovery Experience data team. Chen has focused on building search and recommendation products, experimenting and running A/B tests to improve and measure feature impact, and collaborating with cross-disciplinary teams to bring personalization to Shopify merchants and their customers. Chen is also interested in fairness in AI and has published research in this domain. Chen obtained an M.Sc. in Astrophysics from McGill University, where she discovered 30 radio pulsars by developing signal processing algorithms for telescope data.
Dawen Liang is a research scientist at Netflix, working on core personalization algorithms. His research interests include probabilistic models and (approximate) inference, causal inference, reinforcement learning, and their applications to recommender systems. He completed his Ph.D. in the Electrical Engineering Department at Columbia University, working on probabilistic latent variable models for analyzing music, speech, text, and user behavior data.Â
Anoop Deoras is the head of AI/ML applied sciences organization overseeing applied research in 5 core areas: Generative AI Technologies behind Code — CodeWhisperer, Recommender System as a service — Amazon Personalize, Forecasting and Anomaly Detection for IT/Dev/AI-Ops as a service — Amazon Forecast and Amazon DevOpsGuru and Pytorch framework support and applications for the Amazon AWS Trainium and Inferentia Machine Learning accelerators. Anoop holds a PhD from Johns Hopkins and has interests in fundamental research in and application of timeseries models esp. language (large or small) models in areas as diverse as recommender systems, ad targeting and forecasting.
Abstract:
Pinterest’s mission is to give everyone the inspiration to create a life that they love. We achieve this by generating highly personalization recommendations along users’ journey on Pinterest. Over the years, Pinterest users also curated one of the largest graph-structured data on the Internet. In this talk, we will walk through the unique challenges in the recommendation system at Pinterest, and how we leverage this graph to make some latest development in these real-world applications. We will also showcase how we built the system that has been deployed in production and has delivered significantly better user experience across organic and Ads feeds.Â
Abstract:
Operationalizing AI fairness at LinkedIn’s scale is challenging not only because there are multiple mutually incompatible definitions of fairness but also because determining what is fair depends on the specifics and context of the product where AI is deployed. Moreover, AI practitioners need clarity on what fairness expectations need to be addressed at the AI level. In this work, I'll present the evolving AI fairness framework used at LinkedIn to address these three challenges. The framework disentangles AI fairness by separating out equal treatment and equitable product expectations. Rather than imposing a trade-off between these two commonly opposing interpretations of fairness, the framework provides clear guidelines for operationalizing equal AI treatment complemented with a product equity strategy. I'll present the principles we set forth and cover a real-world case study walking us through how we measure and mitigate for algorithmic bias.Â
Title: How Bad is Top-K Recommendation under Competing Content Creators?  Â
Speaker: Hongning Wang (University of Virginia)
Abstract:
Content creators compete for exposure on recommendation platforms, and such strategic behavior leads to a dynamic shift over the content distribution. However, how the creators' competition impacts user welfare and how the relevance-driven recommendation influences the dynamics in the long run are still largely unknown.
Our recent work provides theoretical insights into these research questions. We model the creators' competition under the assumptions that: 1) the platform employs an innocuous top-K recommendation policy; 2) user decisions follow the Random Utility model; 3) content creators compete for user engagement and, without knowing their utility function in hindsight, apply arbitrary no-regret learning algorithms to update their strategies. We study the user welfare guarantee through the lens of Price of Anarchy and show that the fraction of user welfare loss due to creator competition is always upper bounded by a small constant depending on K and randomness in user decisions; we also prove the tightness of this bound. Our result discloses an intrinsic merit of the myopic approach to the recommendation, i.e., relevance-driven matching performs reasonably well in the long run, as long as users' decisions involve randomness and the platform provides reasonably many alternatives to its users.
Title: ALLY: Large Language Models for Assistive Recommendations.   Â
Speaker: Minmin Chen (Google)
Abstract:
Large language models (LLMs) have shown unmatched capacities in language understanding and generation, prompting surges of interest in its adoption for recommendations. In this talk, I will describe our long quest to pinpoint its ability in natural language understanding and generation to facilitate communication between recommendation agents and users, building trust and enabling user agency on recommendation platforms. I will discuss two lines of work 1) user study on how users rely on recommendation systems to pursue their interest journeys and different techniques to prompt LLMs to provide nuanced and interesting user journey descriptions to assist users in their real life journeys; 2) prompting LLMs to provide different formats of explanations on why a recommendation is made, and improve recommendation with the extracted explanations. The two combined can offer users a glimpse into the recommender agents' competence to build trust, as well as enable users to navigate and control their recommendations. I will discuss research findings discovered along these endeavors and brainstorm challenges and opportunities in integrating LLMs into user-facing products.Â
Title: Optimizing product discovery for millions of merchants.   Â
Speaker: Chen Karako (Shopify)
Abstract:
At Shopify, we optimize product discovery for millions of merchants’ online stores, through product recommendations and search. It’s a challenge to provide good recommendations to all of these independent and diverse businesses; a given model might lead to an overall improvement in metrics on aggregate, but degrade the performance for some stores. Moreover, since each store is unique and has a different customer base, design, product inventory, and data quality, we must take care when running experiments to evaluate the impact of our models. In this talk, we describe the unique challenges involved in improving product discovery for millions of merchants and their customers, and the approaches we employ for model development, offline evaluation, and experimentation.
Title: Zero Shot Recommenders, Large Language Models and Prompt Engineering.  Â
Speaker: Anoop Deoras (Amazon)
 Abstract:
Large scale deep learning recommender system models, which are used for predicting top-k items given a query (typically a user identifier), learn the user to item propensities from large amounts of training data containing many (user, item, time) tuples. Unlike the simpler bi-linear models such as Matrix Factorization, deep learning recommender system models almost always do not model users explicitly bur rather model them via the items they have consumed in the past, with or without temporal ordering information. This allows the model to extrapolate the preference matching on many unseen users as long as the items these unseen users have interacted with are seen during training. However, pure item identifier based models are often incapable in assigning faithful propensities to novel items not previously seen during training. This is the classical item cold start problem in recommender system. Item content features (e.g. item description, movie synopsis, product image, etc) become useful in such cold start scenarios as they become proxy for the item identifier. In this talk, we investigate a rather extreme cold start situation wherein the entire application is treated as cold start. Imagine two related businesses whose product inventory is similar in content type but not in identity. Further assume that one of them lacks sufficient user-item interactions limiting us from building a personalization model. Question we want to ask is whether we can obtain domain agnostic information from the data rich domain — source domain, and transfer it to the data poor domain — target domain ? To this end, in this talk, we present Zero Shot Transfer Learning model that transfers such learned task-agnostic-knowledge from the source to the target domain. We conduct offline experiments on large scale real world datasets and present the results and findings. Time permitting, we will also talk about a novel concept we first introduced back in 2021 — Language Models as Recommender Systems, where we show how we can leverage large pre-trained language models and prompt engineering to generate recommendations in a zero shot setting.Â
Abstract:
Recommender systems seek to answer a policy optimization problem: given data collected from the previously-deployed recommendation policy, our goal is to come up with a new policy that can achieve better outcome when deployed online. However, the optimization for a better policy is counterfactual in nature. The conventional way to address this challenge is through importance sampling correction, which comes with its own practical limitations. In this presentation, we suggest an alternative approach of local policy improvement and present an objective function that is simple to estimate from data without off-policy correction. We discuss the tradeoff between this approach and importance sampling correction and offer some practical arguments for favoring one over the other.
Title: Netflix's Personalization Power-Ups: Using a Foundation Model and Precision-Based Adaptation for Tailored Content Recommendations.   Â
Speakers: Maryam Esmaeili (Netflix) &Â Yesu Feng (Netflix)
Abstract:
At Netflix, personalization is a top priority in the constantly evolving entertainment industry, and we strive to use our insights from a member’s streaming data to enhance their viewing experience. To achieve this, we employ a two-stage process, beginning with a Foundation Model to understand users' long-term preferences. We then use precision-based adaptation to fine-tune it, ensuring that the recommendations align with our subscribers' preferences and needs.Â
The Foundation Model is specifically designed to capture users' long-term preferences and provide accurate representations of Netflix's content. We use large-scale self-supervised learning to analyze users' interaction histories and gain insight into their interests and the evolution of their tastes over time. In this talk, we will explore the similarities and differences between our Foundation Model and large language models, and discuss the challenges we have overcome, such as addressing the cold-start problem and mitigating popularity bias.
We will also discuss how the Foundation Model is utilized by the downstream canvas-serving page construction model. This is achieved by providing embeddings or performing fine-tuning based on business objectives. To optimize the personalized page, we take into account user-specific feedback and contextual features such as time of day, device type, and in-session user signals. Our ultimate goal is to create a personalized page that strikes a perfect balance between factors including user preferences, diversity, and novelty. We achieve this through our page optimization model, which employs user-specific data and contextual information to generate highly personalized recommendations. We continually refine our model by training it with context-specific data to ensure that our recommendations remain current and relevant.
Dynamic fairness-aware recommendation through multi-agent social choice [poster]
Authors: Amanda Aird, Paresha Farastu, Joshua Sun, Amy Voida, Nicholas Mattei, Robin Burke
Information Coverage in Various Knowledge Sources [poster]
Authors: Sneha Singhania, Simon Razniewski, Gerhard Weikum
Scaling Artwork Personalization with Bandits [poster]
Authors: Ronica Jethwa, Pulkit Aggarwal ,Lian Liu, Fei Xiao, Abhishek Bambha, Dan Meropol, Rohit Mahto
 Cold Starting Heterogeneous Entities in Netflix Homepage [poster]
Authors: Ding Tong, Erik Schmidt, Ehsan Saberian, Emma Kong, Raveesh Bhalla, Justin Basilico
Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR [poster]
Authors: Kaiwen Wang, Nathan Kallus, Wen Sun
ContrastVAE: Contrastive Variational AutoEncoder for Sequential Recommendation [poster]
Authors: Yu Wang, Hengrui Zhang, Zhiwei Liu, Liangwei Yang, Philip S. Yu
 Implicit Session Contexts for Next-Item Recommendations [poster]
Authors: Sejoon Oh
S3GC: Scalable Self-Supervised Graph Clustering [poster]
Authors: Devvrit Fnu, Aditya Sinha, Inderjit Dhillon, Prateek Jain
Probabilistic Conformal Prediction Using Conditional Random Samples [poster]
Authors: Zhendong Wang, Ruijiang Gao, Mingzhang Yin, Mingyuan Zhou, David M. Blei
Variational latent branching model for off-policy evaluation [poster]
Authors: Qitong Gao
Ruoxi is a senior software engineer in Google Brain, focusing on fundamental deep learning research and its applications in recommenders, especially on learning better feature interactions and memory and computational efficient models. She also works very closely across teams in Google's major organic and Ads products to put her research into practice. Ruoxi received her Ph.D. from computational mathematics at Stanford University, where her research interests are numerical linear algebra, randomized algorithms and machine learning. When she's not thinking about how math can improve deep learning models and Google products, she really enjoys hanging out with her paw friend MeiMei, and has been trying to get better at swimming.