1 code implementation • 2 Oct 2024 • Aleksei Bochkovskii, Amaël Delaunoy, Hugo Germain, Marcel Santos, Yichao Zhou, Stephan R. Richter, Vladlen Koltun
We present a foundation model for zero-shot metric monocular depth estimation.
no code implementations • 21 Jul 2024 • EunJeong Hwang, Yichao Zhou, James Bradley Wendt, Beliz Gunel, Nguyen Vo, Jing Xie, Sandeep Tata
Large language models (LLMs) often struggle with processing extensive input contexts, which can lead to redundant, inaccurate, or incoherent summaries.
no code implementations • 7 Jun 2024 • EunJeong Hwang, Yichao Zhou, Beliz Gunel, James Bradley Wendt, Sandeep Tata
No existing dataset adequately tests how well language models can incrementally update entity summaries - a crucial ability as these models rapidly advance.
no code implementations • 25 Mar 2024 • Beliz Gunel, James B. Wendt, Jing Xie, Yichao Zhou, Nguyen Vo, Zachary Fisher, Sandeep Tata
Users often struggle with decision-making between two options (A vs B), as it usually requires time-consuming research across multiple web pages.
no code implementations • 15 Dec 2023 • Mengmeng Sheng, Zeren Sun, Zhenhuang Cai, Tao Chen, Yichao Zhou, Yazhou Yao
There has been significant attention devoted to the effectiveness of various domains, such as semi-supervised learning, contrastive learning, and meta-learning, in enhancing the performance of methods for noisy label learning (NLL) tasks.
no code implementations • 5 Dec 2023 • Yuxuan Yan, Chi Zhang, Rui Wang, Yichao Zhou, Gege Zhang, Pei Cheng, Gang Yu, Bin Fu
This study investigates identity-preserving image synthesis, an intriguing task in image generation that seeks to maintain a subject's identity while adding a personalized, stylistic touch.
no code implementations • 20 Dec 2022 • Jing Xie, James B. Wendt, Yichao Zhou, Seth Ebner, Sandeep Tata
Many business workflows require extracting important fields from form-like documents (e. g. bank statements, bills of lading, purchase orders, etc.).
no code implementations • 15 Nov 2022 • Zilong Wang, Yichao Zhou, Wei Wei, Chen-Yu Lee, Sandeep Tata
Understanding visually-rich business documents to extract structured data and automate business workflows has been receiving attention both in academia and industry.
no code implementations • 28 Oct 2022 • Yichao Zhou, James B. Wendt, Navneet Potti, Jing Xie, Sandeep Tata
A key bottleneck in building automatic extraction models for visually rich documents like invoices is the cost of acquiring the several thousand high-quality labeled documents that are needed to train a model with acceptable accuracy.
no code implementations • 8 Aug 2021 • Yichao Zhou, Jyun-Yu Jiang, Xiusi Chen, Wei Wang
COVID-19 has caused lasting damage to almost every domain in public health, society, and economy.
no code implementations • 23 Jun 2021 • Yichao Zhou, Chelsea Ju, J. Harry Caufield, Kevin Shih, Calvin Chen, Yizhou Sun, Kai-Wei Chang, Peipei Ping, Wei Wang
To facilitate various downstream applications using clinical case reports (CCRs), we pre-train two deep contextualized language models, Clinical Embeddings from Language Model (C-ELMo) and Clinical Contextual String Embeddings (C-Flair) using the clinical-related corpus from the PubMed Central.
2 code implementations • CVPR 2021 • Yichao Zhou, Shichen Liu, Yi Ma
Recent advances have shown that symmetry, a structural prior that most objects exhibit, can support a variety of single-view 3D understanding tasks.
no code implementations • 28 Feb 2021 • Yichao Zhou, Wei-Ting Chen, BoWen Zhang, David Lee, J. Harry Caufield, Kai-Wei Chang, Yizhou Sun, Peipei Ping, Wei Wang
Clinical case reports are written descriptions of the unique aspects of a particular clinical case, playing an essential role in sharing clinical experiences about atypical disease phenotypes and new therapies.
2 code implementations • 7 Jan 2021 • Yichao Zhou, Ying Sheng, Nguyen Vo, Nick Edmonds, Sandeep Tata
There has been a steady need to precisely extract structured knowledge from the web (i. e. HTML documents).
no code implementations • ICCV 2021 • Shichen Liu, Yichao Zhou, Yajie Zhao
Being able to infer 3D structures from 2D images with geometric principles, vanishing points have been a well-recognized concept in 3D vision research.
2 code implementations • 16 Dec 2020 • Yichao Zhou, Yu Yan, Rujun Han, J. Harry Caufield, Kai-Wei Chang, Yizhou Sun, Peipei Ping, Wei Wang
There has been a steady need in the medical community to precisely extract the temporal relations between clinical events.
no code implementations • NeurIPS 2020 • Chaobing Song, Zhengyuan Zhou, Yichao Zhou, Yong Jiang, Yi Ma
The optimization problems associated with training generative adversarial neural networks can be largely reduced to certain {\em non-monotone} variational inequality problems (VIPs), whereas existing convergence results are mostly based on monotone or strongly monotone assumptions.
1 code implementation • EMNLP 2020 • Rujun Han, Yichao Zhou, Nanyun Peng
Extracting event temporal relations is a critical task for information extraction and plays an important role in natural language understanding.
no code implementations • 17 Aug 2020 • Shaunak Mishra, Manisha Verma, Yichao Zhou, Kapil Thadani, Wei Wang
Since major ad platforms typically run A/B tests for multiple advertisers in parallel, we explore the possibility of collaboratively learning ad creative refinement via A/B tests of multiple advertisers.
1 code implementation • 7 Aug 2020 • Yichao Zhou, Jingwei Huang, Xili Dai, Shichen Liu, Linjie Luo, Zhili Chen, Yi Ma
We present HoliCity, a city-scale 3D dataset with rich structural information.
no code implementations • ACL 2020 • Yichao Zhou, Jyun-Yu Jiang, Jieyu Zhao, Kai-Wei Chang, Wei Wang
In this paper, we propose Pronunciation-attentive Contextualized Pun Recognition (PCPR) to perceive human humor, detect if a sentence contains puns and locate them in the sentence.
2 code implementations • 17 Jun 2020 • Yichao Zhou, Shichen Liu, Yi Ma
In this work, we focus on object-level 3D reconstruction and present a geometry-based end-to-end deep learning framework that first detects the mirror plane of reflection symmetry that commonly exists in man-made objects and then predicts depth maps by finding the intra-image pixel-wise correspondence of the symmetry.
1 code implementation • 29 Apr 2020 • Yichao Zhou, Jyun-Yu Jiang, Jieyu Zhao, Kai-Wei Chang, Wei Wang
In this paper, we propose Pronunciation-attentive Contextualized Pun Recognition (PCPR) to perceive human humor, detect if a sentence contains puns and locate them in the sentence.
1 code implementation • 20 Jan 2020 • Yichao Zhou, Shaunak Mishra, Manisha Verma, Narayan Bhamidipati, Wei Wang
There is a perennial need in the online advertising industry to refresh ad creatives, i. e., images and text used for enticing online users towards a brand.
1 code implementation • NeurIPS 2019 • Yichao Zhou, Haozhi Qi, Jingwei Huang, Yi Ma
We present a simple yet effective end-to-end trainable deep network with geometry-inspired convolutional operators for detecting vanishing points in images.
1 code implementation • IJCNLP 2019 • Yichao Zhou, Jyun-Yu Jiang, Kai-Wei Chang, Wei Wang
To identify adversarial attacks, a perturbation discriminator validates how likely a token in the text is perturbed and provides a set of potential perturbations.
2 code implementations • ICCV 2019 • Yichao Zhou, Haozhi Qi, Yuexiang Zhai, Qi Sun, Zhili Chen, Li-Yi Wei, Yi Ma
In this paper, we propose a method to obtain a compact and accurate 3D wireframe representation from a single image by effectively exploiting global structural regularities.
1 code implementation • ICCV 2019 • Yichao Zhou, Haozhi Qi, Yi Ma
We conduct extensive experiments and show that our method significantly outperforms the previous state-of-the-art wireframe and line extraction algorithms.
Ranked #5 on
Line Segment Detection
on wireframe dataset
1 code implementation • ICCV 2019 • Jingwei Huang, Yichao Zhou, Thomas Funkhouser, Leonidas Guibas
In this work, we introduce the novel problem of identifying dense canonical 3D coordinate frames from a single RGB image.
no code implementations • 18 Dec 2018 • Yichao Zhou, Wei Chu, Sam Young, Xin Chen
In the learning stage, a sequence of stylistically uniform, multiple-channel music samples was modeled by a RNN.
1 code implementation • EMNLP 2018 • Jieyu Zhao, Yichao Zhou, Zeyu Li, Wei Wang, Kai-Wei Chang
Word embedding models have become a fundamental component in a wide range of Natural Language Processing (NLP) applications.
no code implementations • 8 Dec 2014 • Yichao Zhou, Yuexin Wu, Jianyang Zeng
The computation of the global minimum energy conformation (GMEC) is an important and challenging topic in structure-based computational protein design.