Search Results for author: Jonathan Stray

Found 4 papers, 0 papers with code

Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild

no code implementations • 10 Nov 2023 • Nanna Inie, Jonathan Stray, Leon Derczynski

As a result, this paper presents a grounded theory of how and why people attack large language models: LLM red teaming in the wild.

Paper
Add Code

Building Human Values into Recommender Systems: An Interdisciplinary Synthesis

no code implementations • 20 Jul 2022 • Jonathan Stray, Alon Halevy, Parisa Assar, Dylan Hadfield-Menell, Craig Boutilier, Amar Ashar, Lex Beattie, Michael Ekstrand, Claire Leibowicz, Connie Moon Sehat, Sara Johansen, Lianne Kerlin, David Vickrey, Spandana Singh, Sanne Vrijenhoek, Amy Zhang, McKane Andrus, Natali Helberger, Polina Proutskova, Tanushree Mitra, Nina Vasan

We collect a set of values that seem most relevant to recommender systems operating across different domains, then examine them from the perspectives of current industry practice, measurement, product design, and policy approaches.

Causal Inference Ethics +1

Paper
Add Code

What are you optimizing for? Aligning Recommender Systems with Human Values

no code implementations • 22 Jul 2021 • Jonathan Stray, Ivan Vendrov, Jeremy Nixon, Steven Adler, Dylan Hadfield-Menell

We describe cases where real recommender systems were modified in the service of various human values such as diversity, fairness, well-being, time well spent, and factual accuracy.

Fairness Recommendation Systems

Paper
Add Code

Designing Recommender Systems to Depolarize

no code implementations • 11 Jul 2021 • Jonathan Stray

Polarization is implicated in the erosion of democracy and the progression to violence, which makes the polarization properties of large algorithmic content selection systems (recommender systems) a matter of concern for peace and security.

Recommendation Systems

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.