Search Results for author: Mingchen Zhuge

Found 17 papers, 12 papers with code

AFlow: Automating Agentic Workflow Generation

1 code implementation14 Oct 2024 Jiayi Zhang, Jinyu Xiang, Zhaoyang Yu, Fengwei Teng, Xionghui Chen, Jiaqi Chen, Mingchen Zhuge, Xin Cheng, Sirui Hong, Jinlin Wang, Bingnan Zheng, Bang Liu, Yuyu Luo, Chenglin Wu

Large language models (LLMs) have demonstrated remarkable potential in solving complex tasks across diverse domains, typically by employing agentic workflows that follow detailed instructions and operational sequences.

Code Generation

OpenHands: An Open Platform for AI Software Developers as Generalist Agents

2 code implementations23 Jul 2024 Xingyao Wang, Boxuan Li, Yufan Song, Frank F. Xu, Xiangru Tang, Mingchen Zhuge, Jiayi Pan, Yueqi Song, Bowen Li, Jaskirat Singh, Hoang H. Tran, Fuqiang Li, Ren Ma, Mingzhang Zheng, Bill Qian, Yanjun Shao, Niklas Muennighoff, Yizhe Zhang, Binyuan Hui, Junyang Lin, Robert Brennan, Hao Peng, Heng Ji, Graham Neubig

OpenDevin), a platform for the development of powerful and flexible AI agents that interact with the world in similar ways to those of a human developer: by writing code, interacting with a command line, and browsing the web.

Goldfish: Vision-Language Understanding of Arbitrarily Long Videos

1 code implementation17 Jul 2024 Kirolos Ataallah, Xiaoqian Shen, Eslam Abdelrahman, Essam Sleiman, Mingchen Zhuge, Jian Ding, Deyao Zhu, Jürgen Schmidhuber, Mohamed Elhoseiny

This design of the retrieval mechanism enables the Goldfish to efficiently process arbitrarily long video sequences, facilitating its application in contexts such as movies or television series.

Retrieval Video Understanding

Language Agents as Optimizable Graphs

2 code implementations26 Feb 2024 Mingchen Zhuge, Wenyi Wang, Louis Kirsch, Francesco Faccio, Dmitrii Khizbullin, Jürgen Schmidhuber

Various human-designed prompt engineering techniques have been proposed to improve problem solvers based on Large Language Models (LLMs), yielding many disparate code bases.

Prompt Engineering

Learning to Identify Critical States for Reinforcement Learning from Videos

1 code implementation ICCV 2023 Haozhe Liu, Mingchen Zhuge, Bing Li, Yuhui Wang, Francesco Faccio, Bernard Ghanem, Jürgen Schmidhuber

Recent work on deep reinforcement learning (DRL) has pointed out that algorithmic information about good policies can be extracted from offline data which lack explicit information about executed actions.

Deep Reinforcement Learning reinforcement-learning

QR-CLIP: Introducing Explicit Open-World Knowledge for Location and Time Reasoning

no code implementations2 Feb 2023 Weimin Shi, Mingchen Zhuge, Dehong Gao, Zhong Zhou, Ming-Ming Cheng, Deng-Ping Fan

Daily images may convey abstract meanings that require us to memorize and infer profound information from them.

World Knowledge

Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs

1 code implementation8 Mar 2022 Jingfei Xia, Mingchen Zhuge, Tiantian Geng, Shun Fan, Yuantai Wei, Zhenyu He, Feng Zheng

Figure skating scoring is challenging because it requires judging the technical moves of the players as well as their coordination with the background music.

Diversity Representation Learning

Fast Camouflaged Object Detection via Edge-based Reversible Re-calibration Network

1 code implementation5 Nov 2021 Ge-Peng Ji, Lei Zhu, Mingchen Zhuge, Keren Fu

Camouflaged Object Detection (COD) aims to detect objects with similar patterns (e. g., texture, intensity, colour, etc) to their surroundings, and recently has attracted growing research interest.

Camouflaged Object Segmentation Image Segmentation +3

Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

1 code implementation CVPR 2021 Mingchen Zhuge, Dehong Gao, Deng-Ping Fan, Linbo Jin, Ben Chen, Haoming Zhou, Minghui Qiu, Ling Shao

We present a new vision-language (VL) pre-training model dubbed Kaleido-BERT, which introduces a novel kaleido strategy for fashion cross-modality representations from transformers.

Image Retrieval Text Retrieval

Towards Accurate Camouflaged Object Detection with Mixture Convolution and Interactive Fusion

no code implementations14 Jan 2021 Geng Chen, Xinrui Chen, Bo Dong, Mingchen Zhuge, Yongxiong Wang, Hongbo Bi, Jian Chen, Peng Wang, Yanning Zhang

Our method detects camouflaged objects with an effective fusion strategy, which aggregates the rich context information from a large receptive field.

object-detection Object Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.