Search Results for author: Zhen Zeng

Found 25 papers, 5 papers with code

AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations

no code implementations20 Nov 2024 Gaurav Verma, Rachneet Kaur, Nishan Srishankar, Zhen Zeng, Tucker Balch, Manuela Veloso

Our experiments on two popular benchmarks -- Mind2Web & VisualWebArena -- show that using in-context demonstrations (for proprietary models) or meta-adaptation demonstrations (for meta-learned open-weights models) boosts task success rate by 3. 36% to 7. 21% over non-adapted state-of-the-art models, corresponding to a relative increase of 21. 03% to 65. 75%.

Few-Shot Learning

Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models

no code implementations19 Nov 2024 Zhen Zeng, Leijiang Gu, Xun Yang, Zhangling Duan, Zenglin Shi, Meng Wang

MSCKE leverages a multimodal scope classifier that integrates both visual and textual information to accurately identify and update knowledge related to specific entities within images.

knowledge editing

TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners

no code implementations14 Jun 2024 Tomas De la Rosa, Sriram Gopalakrishnan, Alberto Pozanco, Zhen Zeng, Daniel Borrajo

Travel planning is a complex task that involves generating a sequence of actions related to visiting places subject to constraints and maximizing some user satisfaction criteria.

Language Modelling Large Language Model +1

Evaluating Large Language Models on Time Series Feature Understanding: A Comprehensive Taxonomy and Benchmark

no code implementations25 Apr 2024 Elizabeth Fons, Rachneet Kaur, Soham Palande, Zhen Zeng, Tucker Balch, Manuela Veloso, Svitlana Vyetrenko

Large Language Models (LLMs) offer the potential for automatic time series analysis and reporting, which is a critical task across many domains, spanning healthcare, finance, climate, energy, and many more.

Time Series Time Series Analysis

FlowMind: Automatic Workflow Generation with LLMs

no code implementations17 Mar 2024 Zhen Zeng, William Watson, Nicole Cho, Saba Rahimi, Shayleen Reynolds, Tucker Balch, Manuela Veloso

FlowMind further simplifies user interaction by presenting high-level descriptions of auto-generated workflows, enabling users to inspect and provide feedback effectively.

Benchmarking Question Answering

Financial Time Series Forecasting using CNN and Transformer

no code implementations11 Apr 2023 Zhen Zeng, Rachneet Kaur, Suchetha Siddagangappa, Saba Rahimi, Tucker Balch, Manuela Veloso

In our experiments, we demonstrated the success of the proposed method in comparison to commonly adopted statistical and deep learning methods on forecasting intraday stock price change of S&P 500 constituents.

Decision Making Time Series +1

Self-supervised Graph Learning for Long-tailed Cognitive Diagnosis

no code implementations15 Oct 2022 Shanshan Wang, Zhen Zeng, Xun Yang, Xingyi Zhang

Cognitive diagnosis is a fundamental yet critical research task in the field of intelligent education, which aims to discover the proficiency level of different students on specific knowledge concepts.

cognitive diagnosis Graph Learning

TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training

no code implementations8 Aug 2022 Huaizhen Tang, xulong Zhang, Jianzong Wang, Ning Cheng, Zhen Zeng, Edward Xiao, Jing Xiao

In this paper, a novel voice conversion framework, named $\boldsymbol T$ext $\boldsymbol G$uided $\boldsymbol A$utoVC(TGAVC), is proposed to more effectively separate content and timbre from speech, where an expected content embedding produced based on the text transcriptions is designed to guide the extraction of voice content.

Voice Conversion

SeanNet: Semantic Understanding Network for Localization Under Object Dynamics

1 code implementation5 Oct 2021 Xiao Li, Yidong Du, Zhen Zeng, Odest Chadwicke Jenkins

This paper proposes a SEmantic understANding Network (SeanNet) architecture that enables an effective learning process with coupled visual and semantic inputs.

Contrastive Learning Object +1

Deep Video Prediction for Time Series Forecasting

no code implementations24 Feb 2021 Zhen Zeng, Tucker Balch, Manuela Veloso

In this paper, we propose to approach economic time series forecasting of multiple financial assets in a novel way via video prediction.

Decision Making Time Series +2

MelGlow: Efficient Waveform Generative Network Based on Location-Variable Convolution

3 code implementations3 Dec 2020 Zhen Zeng, Jianzong Wang, Ning Cheng, Jing Xiao

In this paper, an efficient network, named location-variable convolution, is proposed to model the dependencies of waveforms.

Visual Time Series Forecasting: An Image-driven Approach

no code implementations18 Nov 2020 Srijan Sood, Zhen Zeng, Naftali Cohen, Tucker Balch, Manuela Veloso

In this work, we leverage advances in deep learning to extend the field of time series forecasting to a visual setting.

Quantization Time Series +1

Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit

no code implementations13 Aug 2020 Zhen Zeng, Jianzong Wang, Ning Cheng, Jing Xiao

Recent neural speech synthesis systems have gradually focused on the control of prosody to improve the quality of synthesized speech, but they rarely consider the variability of prosody and the correlation between prosody and semantics together.

Language Modelling Position +2

Semantic Linking Maps for Active Visual Object Search

no code implementations18 Jun 2020 Zhen Zeng, Adrian Röfer, Odest Chadwicke Jenkins

SLiM simultaneously maintains the belief over a target object's location as well as landmark objects' locations, while accounting for probabilistic inter-object spatial relations.

Object

AlignTTS: Efficient Feed-Forward Text-to-Speech System without Explicit Alignment

2 code implementations4 Mar 2020 Zhen Zeng, Jianzong Wang, Ning Cheng, Tian Xia, Jing Xiao

Targeting at both high efficiency and performance, we propose AlignTTS to predict the mel-spectrum in parallel.

Text to Speech

GraphTTS: graph-to-sequence modelling in neural text-to-speech

no code implementations4 Mar 2020 Aolan Sun, Jianzong Wang, Ning Cheng, Huayi Peng, Zhen Zeng, Jing Xiao

This paper leverages the graph-to-sequence method in neural text-to-speech (GraphTTS), which maps the graph embedding of the input sequence to spectrograms.

Graph Embedding Graph-to-Sequence +2

A Random Interaction Forest for Prioritizing Predictive Biomarkers

no code implementations4 Oct 2019 Zhen Zeng, Yuefeng Lu, Judong Shen, Wei Zheng, Peter Shaw, Mary Beth Dorr

Precision medicine is becoming a focus in medical research recently, as its implementation brings values to all stakeholders in the healthcare system.

Semantic Mapping with Simultaneous Object Detection and Localization

1 code implementation26 Oct 2018 Zhen Zeng, Yunwen Zhou, Odest Chadwicke Jenkins, Karthik Desingh

Our results demonstrate that the particle filtering based inference of CT-Map provides improved object detection and pose estimation with respect to baseline methods that treat observations as independent samples of a scene.

Robotics

Cannot find the paper you are looking for? You can Submit a new open access paper.