Search Results for author: Chen Huang

Found 64 papers, 23 papers with code

Enhancing Code LLM Training with Programmer Attention

no code implementations19 Mar 2025 Yifan Zhang, Chen Huang, Zachary Karas, Dung Thuy Nguyen, Kevin Leach, Yu Huang

Human attention provides valuable yet underexploited signals for code LLM training, offering a perspective beyond purely machine-driven attention.

Code Summarization

Breaking the Stigma! Unobtrusively Probe Symptoms in Depression Disorder Diagnosis Dialogue

no code implementations25 Jan 2025 Jieming Cao, Chen Huang, Yanan Zhang, Ruibo Deng, Jincheng Zhang, Wenqiang Lei

Stigma has emerged as one of the major obstacles to effectively diagnosing depression, as it prevents users from open conversations about their struggles.

Diagnostic

Digital Twin Online Channel Modeling: Challenges,Principles, and Applications

no code implementations15 Jan 2025 Junling Li, Cheng-Xiang Wang, Chen Huang, Tianrun Qi, Tong Wu

Different from traditional offline channel modeling, digital twin online channel modeling can sense and accurately characterize dynamic wireless channels in real time, and can therefore greatly assist 6G network optimization.

How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond

no code implementations10 Jan 2025 Chen Huang, Yang Deng, Wenqiang Lei, Jiancheng Lv, Tat-Seng Chua, Jimmy Xiangji Huang

With the advancement of large language models (LLMs), intelligent models have evolved from mere tools to autonomous agents with their own goals and strategies for cooperating with humans.

GraphOTTER: Evolving LLM-based Graph Reasoning for Complex Table Question Answering

1 code implementation2 Dec 2024 Qianlong Li, Chen Huang, Shuai Li, Yuanxin Xiang, Deng Xiong, Wenqiang Lei

Despite considerable progress having been made in the LLM era, the reasoning processes of existing methods are often implicit, feeding the entire table into prompts, making it difficult to effectively filter out irrelevant information in the table.

Question Answering

Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIP

no code implementations31 Oct 2024 Chen Huang, Skyler Seto, Samira Abnar, David Grangier, Navdeep Jaitly, Josh Susskind

Then we jointly train a prompt generator, optimized to produce a prompt embedding that stays close to the aggregated summary while minimizing task loss at the same time.

Image Captioning Visual Question Answering (VQA)

Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention

no code implementations14 Oct 2024 Dejia Xu, Yifan Jiang, Chen Huang, Liangchen Song, Thorsten Gernoth, Liangliang Cao, Zhangyang Wang, Hao Tang

Recent studies have attempted to incorporate camera control into the generation process, but their results are often limited to simple trajectories or lack the ability to generate consistent videos from multiple distinct camera paths for the same scene.

Image to Video Generation

Text Clustering as Classification with LLMs

1 code implementation30 Sep 2024 Chen Huang, Guoxiu He

Second, after integrating similar labels generated by the LLM, we prompt the LLM to assign the most appropriate label to each sample in the dataset.

Classification Clustering +2

Beyond Persuasion: Towards Conversational Recommender System with Credible Explanations

1 code implementation22 Sep 2024 Peixin Qin, Chen Huang, Yang Deng, Wenqiang Lei, Tat-Seng Chua

With the aid of large language models, current conversational recommender system (CRS) has gaining strong abilities to persuade users to accept recommended items.

Explanation Generation Recommendation Systems

On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization

no code implementations5 Sep 2024 Yong Lin, Skyler Seto, Maartje ter Hoeve, Katherine Metcalf, Barry-John Theobald, Xuan Wang, Yizhe Zhang, Chen Huang, Tong Zhang

These findings highlight that DPORM has limited generalization ability and substantiates the integration of an explicit reward model in iterative DPO approaches.

How JEPA Avoids Noisy Features: The Implicit Bias of Deep Linear Self Distillation Networks

no code implementations3 Jul 2024 Etai Littwin, Omid Saremi, Madhu Advani, Vimal Thilak, Preetum Nakkiran, Chen Huang, Joshua Susskind

A recent successful approach that falls under the JEPA framework is self-distillation, where an online encoder is trained to predict the output of the target encoder, sometimes using a lightweight predictor network.

Decoder Self-Supervised Learning

Legend: Leveraging Representation Engineering to Annotate Safety Margin for Preference Datasets

1 code implementation12 Jun 2024 Duanyu Feng, Bowen Qin, Chen Huang, Youcheng Huang, Zheng Zhang, Wenqiang Lei

By leveraging this safety direction, Legend can then leverage the semantic distances of paired responses along this direction to annotate margins automatically.

ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation

no code implementations20 May 2024 Chen Huang, Yiping Jin, Ilija Ilievski, Wenqiang Lei, Jiancheng Lv

To address this issue, interactive data annotation utilizes an annotation model to provide suggestions for humans to approve or correct.

Selective Annotation via Data Allocation: These Data Should Be Triaged to Experts for Annotation Rather Than the Model

no code implementations20 May 2024 Chen Huang, Yang Deng, Wenqiang Lei, Jiancheng Lv, Ido Dagan

As such, informative or hard data is assigned to the expert for annotation, while easy data is handled by the model.

Co-Matching: Towards Human-Machine Collaborative Legal Case Matching

no code implementations16 May 2024 Chen Huang, Xinwei Yang, Yang Deng, Wenqiang Lei, Jiancheng Lv, Tat-Seng Chua

However, successful legal case matching requires the tacit knowledge of legal practitioners, which is difficult to verbalize and encode into machines.

Towards Analyzing and Understanding the Limitations of DPO: A Theoretical Perspective

no code implementations6 Apr 2024 Duanyu Feng, Bowen Qin, Chen Huang, Zheng Zhang, Wenqiang Lei

Direct Preference Optimization (DPO), which derives reward signals directly from pairwise preference data, has shown its effectiveness on aligning Large Language Models (LLMs) with human preferences.

Concept -- An Evaluation Protocol on Conversational Recommender Systems with System-centric and User-centric Factors

1 code implementation4 Apr 2024 Chen Huang, Peixin Qin, Yang Deng, Wenqiang Lei, Jiancheng Lv, Tat-Seng Chua

The conversational recommendation system (CRS) has been criticized regarding its user experience in real-world scenarios, despite recent significant progress achieved in academia.

Conversational Recommendation Recommendation Systems

Strength Lies in Differences! Improving Strategy Planning for Non-collaborative Dialogues via Diversified User Simulation

no code implementations11 Mar 2024 Tong Zhang, Chen Huang, Yang Deng, Hongru Liang, Jia Liu, Zujie Wen, Wenqiang Lei, Tat-Seng Chua

We investigate non-collaborative dialogue agents, which are expected to engage in strategic conversations with diverse users, for securing a mutual agreement that leans favorably towards the system's objectives.

User Simulation

Arellano-Bond LASSO Estimator for Dynamic Linear Panel Models

1 code implementation1 Feb 2024 Victor Chernozhukov, Iván Fernández-Val, Chen Huang, Weining Wang

We show that weak dependence along the panel's time series dimension naturally implies approximate sparsity of the most informative moment conditions, motivating the following approach to remove the bias: First, apply LASSO to the cross-section data at each time period to construct most informative (and cross-fitted) instruments, using lagged values of suitable covariates.

Time Series

Cross-Space Adaptive Filter: Integrating Graph Topology and Node Attributes for Alleviating the Over-smoothing Problem

1 code implementation26 Jan 2024 Chen Huang, Haoyang Li, Yifan Zhang, Wenqiang Lei, Jiancheng Lv

To this end, various methods have been proposed to create an adaptive filter by incorporating an extra filter (e. g., a high-pass filter) extracted from the graph topology.

Attribute Node Classification

DREditor: An Time-efficient Approach for Building a Domain-specific Dense Retrieval Model

1 code implementation23 Jan 2024 Chen Huang, Duanyu Feng, Wenqiang Lei, Jiancheng Lv

Motivated by this, we develop a time-efficient approach called DREditor to edit the matching rule of an off-the-shelf dense retrieval model to suit a specific domain.

Retrieval

Towards Equipping Transformer with the Ability of Systematic Compositionality

1 code implementation12 Dec 2023 Chen Huang, Peixin Qin, Wenqiang Lei, Jiancheng Lv

One of the key factors in language productivity and human cognition is the ability of systematic compositionality, which refers to understanding composed unseen examples of seen primitives.

LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures

no code implementations7 Dec 2023 Vimal Thilak, Chen Huang, Omid Saremi, Laurent Dinh, Hanlin Goh, Preetum Nakkiran, Joshua M. Susskind, Etai Littwin

In this paper, we introduce LiDAR (Linear Discriminant Analysis Rank), a metric designed to measure the quality of representations within JE architectures.

Adaptivity and Modularity for Efficient Generalization Over Task Complexity

no code implementations13 Oct 2023 Samira Abnar, Omid Saremi, Laurent Dinh, Shantel Wilson, Miguel Angel Bautista, Chen Huang, Vimal Thilak, Etai Littwin, Jiatao Gu, Josh Susskind, Samy Bengio

We investigate how the use of a mechanism for adaptive and modular computation in transformers facilitates the learning of tasks that demand generalization over the number of sequential computation steps (i. e., the depth of the computation graph).

Retrieval

Cryo-Electron Ptychography: Applications and Potential in Biological Characterisation

no code implementations9 Sep 2023 Chen Huang, Judy S. Kim, Angus I. Kirkland

There is a clear need for developments in characterisation techniques that provide detailed information about structure-function relationships in biology.

3D Reconstruction Single Particle Analysis

DUET: 2D Structured and Approximately Equivariant Representations

1 code implementation28 Jun 2023 Xavier Suau, Federico Danieli, T. Anderson Keller, Arno Blaas, Chen Huang, Jason Ramapuram, Dan Busbridge, Luca Zappella

We propose 2D strUctured and EquivarianT representations (coined DUET), which are 2d representations organized in a matrix structure, and equivariant with respect to transformations acting on the input data.

Self-Supervised Learning Transfer Learning

Semi-Supervised and Long-Tailed Object Detection with CascadeMatch

no code implementations24 May 2023 Yuhang Zang, Kaiyang Zhou, Chen Huang, Chen Change Loy

This paper focuses on long-tailed object detection in the semi-supervised learning setting, which poses realistic challenges, but has rarely been studied in the literature.

Long-tailed Object Detection Object +3

MAST: Masked Augmentation Subspace Training for Generalizable Self-Supervised Priors

no code implementations7 Mar 2023 Chen Huang, Hanlin Goh, Jiatao Gu, Josh Susskind

We do so by Masked Augmentation Subspace Training (or MAST) to encode in the single feature space the priors from different data augmentations in a factorized way.

Instance Segmentation Self-Supervised Learning +1

Unified Vision and Language Prompt Learning

1 code implementation13 Oct 2022 Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy

Prompt tuning, a parameter- and data-efficient transfer learning paradigm that tunes only a small number of parameters in a model's input space, has become a trend in the vision community since the emergence of large vision-language models like CLIP.

Domain Generalization Few-Shot Learning +2

Pre-Training Representations of Binary Code Using Contrastive Learning

no code implementations11 Oct 2022 Yifan Zhang, Chen Huang, Yueke Zhang, Kevin Cao, Scott Thomas Andersen, Huajie Shao, Kevin Leach, Yu Huang

We further analyze the impact of human-written and synthetic comments on binary code comprehension tasks, revealing a significant performance disparity.

Code Summarization Computer Security +3

A Weighted Random Forest Based PositioningAlgorithm for 6G Indoor Communications

no code implementations22 Aug 2022 Yang Wu, Yinghua Wang, Jie Huang, Cheng-Xiang Wang, Chen Huang

Due to the indoor none-line-of-sight (NLoS) propagation and multi-access interference (MAI), it is a great challenge to achieve centimeter-level positioning accuracy in indoor scenarios.

Position Prediction as an Effective Pretraining Strategy

1 code implementation15 Jul 2022 Shuangfei Zhai, Navdeep Jaitly, Jason Ramapuram, Dan Busbridge, Tatiana Likhomanenko, Joseph Yitan Cheng, Walter Talbott, Chen Huang, Hanlin Goh, Joshua Susskind

This pretraining strategy which has been used in BERT models in NLP, Wav2Vec models in Speech and, recently, in MAE models in Vision, forces the model to learn about relationships between the content in different parts of the input using autoencoding related objectives.

Position Prediction +2

Efficient Representation Learning via Adaptive Context Pooling

no code implementations5 Jul 2022 Chen Huang, Walter Talbott, Navdeep Jaitly, Josh Susskind

Inspired by the success of ConvNets that are combined with pooling to capture long-range dependencies, we learn to pool neighboring features for each token before computing attention in a given attention layer.

Representation Learning

An Empirical Study of Language Model Integration for Transducer based Speech Recognition

no code implementations31 Mar 2022 Huahuan Zheng, Keyu An, Zhijian Ou, Chen Huang, Ke Ding, Guanglu Wan

Based on the DR method, we propose a low-order density ratio method (LODR) by replacing the estimation with a low-order weak language model.

Language Modeling Language Modelling +2

Open-Vocabulary DETR with Conditional Matching

4 code implementations22 Mar 2022 Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy

To this end, we propose a novel open-vocabulary detector based on DETR -- hence the name OV-DETR -- which, once trained, can detect any object given its class name or an exemplar image.

Language Modelling object-detection +2

Artificial intelligence enabled radio propagation for communications-Part I: Channel characterization and antenna-channel optimization

no code implementations24 Nov 2021 Chen Huang, Ruisi He, Bo Ai, Andreas F. Molisch, Buon Kiong Lau, Katsuyuki Haneda, Bo Liu, Cheng-Xiang Wang, Mi Yang, Claude Oestges, Zhangdui Zhong

To provide higher data rates, as well as better coverage, cost efficiency, security, adaptability, and scalability, the 5G and beyond 5G networks are developed with various artificial intelligence techniques.

Artificial intelligence enabled radio propagation for communications-Part II: Scenario identification and channel modeling

no code implementations24 Nov 2021 Chen Huang, Ruisi He, Bo Ai, Andreas F. Molisch, Buon Kiong Lau, Katsuyuki Haneda, Bo Liu, Cheng-Xiang Wang, Mi Yang, Claude Oestges, Zhangdui Zhong

This two-part paper investigates the application of artificial intelligence (AI) and in particular machine learning (ML) to the study of wireless propagation channels.

A Geometry-Based Stochastic Model for Truck Communication Channels in Freeway Scenarios

no code implementations20 Oct 2021 Chen Huang, Rui Wang, Cheng-Xiang Wang, Pan Tang, Andreas F. Molisch

We validate this model by contrasting the root-mean-square delay spread and the angular spreads of departure/arrival derived from the channel model with the outcomes directly derived from the measurements.

Collision Avoidance

A Dot Product Attention Free Transformer

no code implementations29 Sep 2021 Shuangfei Zhai, Walter Talbott, Nitish Srivastava, Chen Huang, Hanlin Goh, Ruixiang Zhang, Joshua M. Susskind

We introduce Dot Product Attention Free Transformer (DAFT), an efficient variant of Transformers \citep{transformer} that eliminates the query-key dot product in self attention.

Image Classification Language Modelling

An Attention Free Transformer

11 code implementations28 May 2021 Shuangfei Zhai, Walter Talbott, Nitish Srivastava, Chen Huang, Hanlin Goh, Ruixiang Zhang, Josh Susskind

We introduce Attention Free Transformer (AFT), an efficient variant of Transformers that eliminates the need for dot product self attention.

Position

Uniform Inference on High-dimensional Spatial Panel Networks

no code implementations16 May 2021 Victor Chernozhukov, Chen Huang, Weining Wang

We propose employing a debiased-regularized, high-dimensional generalized method of moments (GMM) framework to perform inference on large-scale spatial panel networks.

MetricOpt: Learning to Optimize Black-Box Evaluation Metrics

no code implementations CVPR 2021 Chen Huang, Shuangfei Zhai, Pengsheng Guo, Josh Susskind

This leads to consistent improvements since the value function provides effective metric supervision during finetuning, and helps to correct the potential bias of loss-only supervision.

Image Classification Image Retrieval +3

FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation

1 code implementation ICCV 2021 Yuhang Zang, Chen Huang, Chen Change Loy

We propose a simple yet effective method, Feature Augmentation and Sampling Adaptation (FASA), that addresses the data scarcity issue by augmenting the feature space especially for rare classes.

Instance Segmentation Segmentation +2

Addressing the Loss-Metric Mismatch with Adaptive Loss Alignment

no code implementations15 May 2019 Chen Huang, Shuangfei Zhai, Walter Talbott, Miguel Angel Bautista, Shih-Yu Sun, Carlos Guestrin, Josh Susskind

In most machine learning training paradigms a fixed, often handcrafted, loss function is assumed to be a good proxy for an underlying evaluation metric.

General Classification Meta-Learning +3

Dense Intrinsic Appearance Flow for Human Pose Transfer

1 code implementation CVPR 2019 Yining Li, Chen Huang, Chen Change Loy

Unlike existing methods, we propose to estimate dense and intrinsic 3D appearance flow to better guide the transfer of pixels between poses.

Pose Transfer

Pose Guided Human Video Generation

no code implementations ECCV 2018 Ceyuan Yang, Zhe Wang, Xinge Zhu, Chen Huang, Jianping Shi, Dahua Lin

Human pose, on the other hand, can represent motion patterns intrinsically and interpretably, and impose the geometric constraints regardless of appearance.

Generative Adversarial Network motion prediction +1

Deep Imbalanced Learning for Face Recognition and Attribute Prediction

1 code implementation1 Jun 2018 Chen Huang, Yining Li, Chen Change Loy, Xiaoou Tang

Data for face analysis often exhibit highly-skewed class distribution, i. e., most data belong to a few majority classes, while the minority classes only contain a scarce amount of instances.

Attribute Face Recognition +2

Mask-aware Photorealistic Face Attribute Manipulation

no code implementations24 Apr 2018 Ruoqi Sun, Chen Huang, Jianping Shi, Lizhuang Ma

The task of face attribute manipulation has found increasing applications, but still remains challeng- ing with the requirement of editing the attributes of a face image while preserving its unique details.

Attribute Face Recognition +1

CNNs are Globally Optimal Given Multi-Layer Support

no code implementations7 Dec 2017 Chen Huang, Chen Kong, Simon Lucey

Stochastic Gradient Descent (SGD) is the central workhorse for training modern CNNs.

Learning to Disambiguate by Asking Discriminative Questions

no code implementations ICCV 2017 Yining Li, Chen Huang, Xiaoou Tang, Chen-Change Loy

In particular, each tuple consists of a pair of images and 4. 6 discriminative questions (as positive samples) and 5. 9 non-discriminative questions (as negative samples) on average.

Benchmarking Image Captioning +4

Learning Policies for Adaptive Tracking with Deep Feature Cascades

no code implementations ICCV 2017 Chen Huang, Simon Lucey, Deva Ramanan

Our fundamental insight is to take an adaptive approach, where easy frames are processed with cheap features (such as pixel values), while challenging frames are processed with invariant but expensive deep features.

Decision Making Reinforcement Learning +1

Need for Speed: A Benchmark for Higher Frame Rate Object Tracking

1 code implementation ICCV 2017 Hamed Kiani Galoogahi, Ashton Fagg, Chen Huang, Deva Ramanan, Simon Lucey

In this paper, we propose the first higher frame rate video dataset (called Need for Speed - NfS) and benchmark for visual object tracking.

Visual Object Tracking

Local Similarity-Aware Deep Feature Embedding

no code implementations NeurIPS 2016 Chen Huang, Chen Change Loy, Xiaoou Tang

Existing deep embedding methods in vision tasks are capable of learning a compact Euclidean space from images, where Euclidean distances correspond to a similarity metric.

Image Retrieval Retrieval +2

Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking

2 code implementations19 Jul 2016 Guanghan Ning, Zhi Zhang, Chen Huang, Zhihai He, Xiaobo Ren, Haohong Wang

In this paper, we develop a new approach of spatially supervised recurrent convolutional neural networks for visual object tracking.

Binary Classification object-detection +3

Learning Deep Representation for Imbalanced Classification

no code implementations CVPR 2016 Chen Huang, Yining Li, Chen Change Loy, Xiaoou Tang

We further demonstrate that more discriminative deep representation can be learned by enforcing a deep network to maintain both inter-cluster and inter-class margins.

Classification General Classification +2

Discriminative Sparse Neighbor Approximation for Imbalanced Learning

no code implementations3 Feb 2016 Chen Huang, Chen Change Loy, Xiaoou Tang

These methods further deteriorate on small, imbalanced data that has a large degree of class overlap.

Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.