Search Results for author: Misha Sra

Found 14 papers, 7 papers with code

Engaging with AI: How Interface Design Shapes Human-AI Collaboration in High-Stakes Decision-Making

no code implementations28 Jan 2025 Zichen Chen, Yunhao Luo, Misha Sra

In many existing human + AI systems, decision-making support is typically provided in the form of text explanations (XAI) to help users understand the AI's reasoning.

Decision Making

Everyday AR through AI-in-the-Loop

no code implementations17 Dec 2024 Ryo Suzuki, Mar Gonzalez-Franco, Misha Sra, David Lindlbauer

This workshop brings together experts and practitioners from augmented reality (AR) and artificial intelligence (AI) to shape the future of AI-in-the-loop everyday AR experiences.

LocoVR: Multiuser Indoor Locomotion Dataset in Virtual Reality

1 code implementation9 Oct 2024 Kojiro Takeyama, Yimeng Liu, Misha Sra

Understanding human locomotion is crucial for AI agents such as robots, particularly in complex indoor home environments.

Social Navigation

SiCo: A Size-Controllable Virtual Try-On Approach for Informed Decision-Making

1 code implementation5 Aug 2024 Sherry X. Chen, Alex Christopher Lim, Yimeng Liu, Pradeep Sen, Misha Sra

Based on our evaluation, we believe our VTO design has the potential to reduce return rates and enhance the online clothes shopping experience.

Decision Making Virtual Try-on

AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling

1 code implementation8 Jul 2024 Sherry X. Chen, Yaron Vaxman, Elad Ben Baruch, David Asulin, Aviad Moreshet, Misha Sra, Pradeep Sen

We propose Image Content Appeal Assessment (ICAA), a novel metric that quantifies the level of positive interest an image's content generates for viewers, such as the appeal of food in a photograph.

GraphEval2000: Benchmarking and Improving Large Language Models on Graph Datasets

no code implementations23 Jun 2024 Qiming Wu, Zichen Chen, Will Corcoran, Misha Sra, Ambuj K. Singh

To address this gap, we introduce GraphEval2000, the first comprehensive graph dataset, comprising 40 graph data structure problems along with 2000 test cases.

Benchmarking

Artificial Leviathan: Exploring Social Evolution of LLM Agents Through the Lens of Hobbesian Social Contract Theory

no code implementations20 Jun 2024 Gordon Dai, Weijia Zhang, Jinhan Li, Siqi Yang, Chidera Onochie lbe, Srihas Rao, Arthur Caetano, Misha Sra

We analyze whether, as the theory postulates, agents seek to escape a brutish "state of nature" by surrendering rights to an absolute sovereign in exchange for order and security.

TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing

1 code implementation CVPR 2024 Sherry X. Chen, Yaron Vaxman, Elad Ben Baruch, David Asulin, Aviad Moreshet, Kuo-Chin Lien, Misha Sra, Pradeep Sen

Previous approaches have focused on either fine-tuning pre-trained T2I models on specific datasets to generate certain kinds of images (e. g., with a specific object or person), or on optimizing the weights, text prompts, and/or learning features for each input image in an attempt to coax the image generator to produce the desired result.

XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs

1 code implementation15 Nov 2023 Zichen Chen, Jianda Chen, Ambuj Singh, Misha Sra

Large Language Models (LLMs) have achieved remarkable success in natural language tasks, yet understanding their reasoning processes remains a significant challenge.

Decision Making Decoder +6

LMExplainer: Grounding Knowledge and Explaining Language Models

no code implementations29 Mar 2023 Zichen Chen, Jianda Chen, YuanYuan Chen, Han Yu, Ambuj K Singh, Misha Sra

By comparing the explanations generated by LMExplainer with those of other models, we show that our approach offers more comprehensive and clearer explanations of the reasoning process.

Decision Making Graph Attention

SPOTR: Spatio-temporal Pose Transformers for Human Motion Prediction

no code implementations11 Mar 2023 Avinash Ajit Nargund, Misha Sra

Our contributions are threefold: (i) we frame human motion prediction as a sequence-to-sequence problem and propose a non-autoregressive Transformer to forecast a sequence of poses in parallel; (ii) our method is activity agnostic; (iii) we show that despite its simplicity, our approach is able to make accurate predictions, achieving better or comparable results compared to the state-of-the-art on two public datasets, with far fewer parameters and much faster inference.

Autonomous Driving Human motion prediction +1

Self-Supervised Knowledge Assimilation for Expert-Layman Text Style Transfer

1 code implementation6 Oct 2021 Wenda Xu, Michael Saxon, Misha Sra, William Yang Wang

This is a particularly notable issue in the medical domain, where layman are often confused by medical text online.

Language Modelling Self-Supervised Learning +2

Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

1 code implementation26 Jun 2021 Pulkit Tandon, Shubham Chandak, Pat Pataranutaporn, Yimeng Liu, Anesu M. Mapuranga, Pattie Maes, Tsachy Weissman, Misha Sra

Video represents the majority of internet traffic today, driving a continual race between the generation of higher quality content, transmission of larger file sizes, and the development of network infrastructure.

Talking Face Generation Talking Head Generation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.