Search Results for author: Boyi Li

Found 25 papers, 11 papers with code

An All-in-One Network for Dehazing and Beyond

2 code implementations • 20 Jul 2017 • Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, Dan Feng

This paper proposes an image dehazing model built with a convolutional neural network (CNN), called All-in-One Dehazing Network (AOD-Net).

Image Dehazing object-detection +2

Paper
Code

End-to-End United Video Dehazing and Detection

no code implementations • 12 Sep 2017 • Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, Dan Feng

Furthermore, we build an End-to-End United Video Dehazing and Detection Network(EVDD-Net), which concatenates and jointly trains EVD-Net with a video object detection model.

Image Dehazing object-detection +1

Paper
Add Code

AOD-Net: All-In-One Dehazing Network

1 code implementation • ICCV 2017 • Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, Dan Feng

This paper proposes an image dehazing model built with a convolutional neural network (CNN), called All-in-One Dehazing Network (AOD-Net).

Ranked #20 on Image Dehazing on SOTS Outdoor

Image Dehazing object-detection +2

1,360

Paper
Code

Benchmarking Single Image Dehazing and Beyond

1 code implementation • 12 Dec 2017 • Boyi Li, Wenqi Ren, Dengpan Fu, DaCheng Tao, Dan Feng, Wen-Jun Zeng, Zhangyang Wang

We present a comprehensive study and evaluation of existing single image dehazing algorithms, using a new large-scale benchmark consisting of both synthetic and real-world hazy images, called REalistic Single Image DEhazing (RESIDE).

Benchmarking Image Dehazing +1

Paper
Code

FastFusionNet: New State-of-the-Art for DAWNBench SQuAD

2 code implementations • 28 Feb 2019 • Felix Wu, Boyi Li, Lequn Wang, Ni Lao, John Blitzer, Kilian Q. Weinberger

In this technical report, we introduce FastFusionNet, an efficient variant of FusionNet [12].

Reading Comprehension Retrieval

Paper
Code

Positional Normalization

2 code implementations • NeurIPS 2019 • Boyi Li, Felix Wu, Kilian Q. Weinberger, Serge Belongie

A popular method to reduce the training time of deep neural networks is to normalize activations at each layer.

154

Paper
Code

Neural Network Out-of-Distribution Detection for Regression Tasks

no code implementations • 25 Sep 2019 • Geoff Pleiss, Amauri Souza, Joseph Kim, Boyi Li, Kilian Q. Weinberger

Neural network out-of-distribution (OOD) detection aims to identify when a model is unable to generalize to new inputs, either due to covariate shift or anomalous data.

Out-of-Distribution Detection Out of Distribution (OOD) Detection +1

Paper
Add Code

Integrated Triaging for Fast Reading Comprehension

no code implementations • 28 Sep 2019 • Felix Wu, Boyi Li, Lequn Wang, Ni Lao, John Blitzer, Kilian Q. Weinberger

This paper introduces Integrated Triaging, a framework that prunes almost all context in early layers of a network, leaving the remaining (deep) layers to scan only a tiny fraction of the full corpus.

Computational Efficiency Machine Reading Comprehension +1

Paper
Add Code

On Feature Normalization and Data Augmentation

1 code implementation • CVPR 2021 • Boyi Li, Felix Wu, Ser-Nam Lim, Serge Belongie, Kilian Q. Weinberger

The moments (a. k. a., mean and standard deviation) of latent features are often removed as noise when training image recognition models, to increase stability and reduce training time.

Ranked #32 on Domain Generalization on ImageNet-A

Data Augmentation Domain Generalization +2

142

Paper
Code

WiCV 2020: The Seventh Women In Computer Vision Workshop

no code implementations • 11 Jan 2021 • Hazel Doughty, Nour Karessli, Kathryn Leonard, Boyi Li, Carianne Martinez, Azadeh Mobasher, Arsha Nagrani, Srishti Yadav

It provides a voice to a minority (female) group in computer vision community and focuses on increasingly the visibility of these researchers, both in academia and industry.

Paper
Add Code

SITTA: Single Image Texture Translation for Data Augmentation

2 code implementations • 25 Jun 2021 • Boyi Li, Yin Cui, Tsung-Yi Lin, Serge Belongie

In this paper, we propose and explore the problem of image translation for data augmentation.

Data Augmentation Few-Shot Image Classification +2

Paper
Code

Fixed Neural Network Steganography: Train the images, not the network

1 code implementation • ICLR 2022 • Varsha Kishore, Xiangyu Chen, Yan Wang, Boyi Li, Kilian Q Weinberger

Recent attempts at image steganography make use of advances in deep learning to train an encoder-decoder network pair to hide and retrieve secret messages in images.

Image Steganography Steganalysis

Paper
Code

Language-driven Semantic Segmentation

1 code implementation • ICLR 2022 • Boyi Li, Kilian Q. Weinberger, Serge Belongie, Vladlen Koltun, René Ranftl

We present LSeg, a novel model for language-driven semantic image segmentation.

Ranked #1 on Few-Shot Semantic Segmentation on FSS-1000

Descriptive Few-Shot Semantic Segmentation +4

669

Paper
Code

WiCV 2021: The Eighth Women In Computer Vision Workshop

no code implementations • 11 Mar 2022 • Arushi Goel, Niveditha Kalavakonda, Nour Karessli, Tejaswi Kasarla, Kathryn Leonard, Boyi Li, Nermin Samet and, Ghada Zamzmi

In this paper, we present the details of Women in Computer Vision Workshop - WiCV 2021, organized alongside the virtual CVPR 2021.

Paper
Add Code

Re-evaluating the Need for Multimodal Signals in Unsupervised Grammar Induction

no code implementations • 20 Dec 2022 • Boyi Li, Rodolfo Corona, Karttikeya Mangalam, Catherine Chen, Daniel Flaherty, Serge Belongie, Kilian Q. Weinberger, Jitendra Malik, Trevor Darrell, Dan Klein

Are multimodal inputs necessary for grammar induction?

Constituency Parsing

Paper
Add Code

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models

1 code implementation • 23 May 2023 • Long Lian, Boyi Li, Adam Yala, Trevor Darrell

Our method significantly outperforms the base diffusion model and several strong baselines in accurately generating images according to prompts that require various capabilities, doubling the generation accuracy across four tasks on average.

Common Sense Reasoning Language Modelling +2

352

Paper
Code

LLM-grounded Video Diffusion Models

no code implementations • 29 Sep 2023 • Long Lian, Baifeng Shi, Adam Yala, Trevor Darrell, Boyi Li

We show that LLMs are able to understand complex spatiotemporal dynamics from text alone and generate layouts that align closely with both the prompts and the object motion patterns typically observed in the real world.

Language Modelling Large Language Model +1

Paper
Add Code

Interactive Task Planning with Language Models

no code implementations • 16 Oct 2023 • Boyi Li, Philipp Wu, Pieter Abbeel, Jitendra Malik

An interactive robot framework accomplishes long-horizon task planning and can easily generalize to new goals or distinct tasks, even during execution.

Language Modelling Large Language Model +1

Paper
Add Code

EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

1 code implementation • 3 Nov 2023 • Jiawei Yang, Boris Ivanovic, Or Litany, Xinshuo Weng, Seung Wook Kim, Boyi Li, Tong Che, Danfei Xu, Sanja Fidler, Marco Pavone, Yue Wang

We present EmerNeRF, a simple yet powerful approach for learning spatial-temporal representations of dynamic driving scenes.

Optical Flow Estimation Semantic Segmentation

479

Paper
Code

From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation

no code implementations • 21 Nov 2023 • Jiaxin Ge, Sanjay Subramanian, Trevor Darrell, Boyi Li

Addressing the challenge of adapting pre-trained vision-language models for generating insightful explanations for visual reasoning tasks with limited annotations, we present ReVisE: a $\textbf{Re}$cursive $\textbf{Vis}$ual $\textbf{E}$xplanation algorithm.

Explanation Generation Visual Question Answering (VQA) +1

Paper
Add Code

Self-correcting LLM-controlled Diffusion Models

no code implementations • 27 Nov 2023 • Tsung-Han Wu, Long Lian, Joseph E. Gonzalez, Boyi Li, Trevor Darrell

Steered by an LLM controller, SLD turns text-to-image generation into an iterative closed-loop process, ensuring correctness in the resulting image.

Attribute Text-to-Image Generation

Paper
Add Code

Synthesizing Moving People with 3D Control

no code implementations • 19 Jan 2024 • Boyi Li, Jathushan Rajasegaran, Yossi Gandelsman, Alexei A. Efros, Jitendra Malik

This disentangled approach allows our method to generate a sequence of images that are faithful to the target motion in the 3D pose and, to the input image in terms of visual similarity.

Paper
Add Code

Driving Everywhere with Large Language Model Policy Adaptation

no code implementations • 8 Feb 2024 • Boyi Li, Yue Wang, Jiageng Mao, Boris Ivanovic, Sushant Veer, Karen Leung, Marco Pavone

Adapting driving behavior to new environments, customs, and laws is a long-standing problem in autonomous driving, precluding the widespread deployment of autonomous vehicles (AVs).

Autonomous Driving Language Modelling +2

Paper
Add Code

Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

no code implementations • 21 Mar 2024 • Sihyun Yu, Weili Nie, De-An Huang, Boyi Li, Jinwoo Shin, Anima Anandkumar

To tackle this issue, we propose content-motion latent diffusion model (CMD), a novel efficient extension of pretrained image diffusion models for video generation.

Video Generation

Paper
Add Code

Crypto Inverse-Power Options and Fractional Stochastic Volatility

no code implementations • 24 Mar 2024 • Boyi Li, Weixuan Xia

Recent empirical evidence has highlighted the crucial role of jumps in both price and volatility within the cryptocurrency market.

Computational Efficiency

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.