Search Results for author: Menglin Jia

Found 14 papers, 12 papers with code

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

1 code implementation • 8 Apr 2024 • Bo He, Hengduo Li, Young Kyun Jang, Menglin Jia, Xuefei Cao, Ashish Shah, Abhinav Shrivastava, Ser-Nam Lim

However, existing LLM-based large multimodal models (e. g., Video-LLaMA, VideoChat) can only take in a limited number of frames for short video understanding.

Ranked #1 on Video Classification on COIN

Question Answering Video Captioning +4

109

Paper
Code

VideoGLUE: Video General Understanding Evaluation of Foundation Models

1 code implementation • 6 Jul 2023 • Liangzhe Yuan, Nitesh Bharadwaj Gundavarapu, Long Zhao, Hao Zhou, Yin Cui, Lu Jiang, Xuan Yang, Menglin Jia, Tobias Weyand, Luke Friedman, Mikhail Sirotenko, Huisheng Wang, Florian Schroff, Hartwig Adam, Ming-Hsuan Yang, Ting Liu, Boqing Gong

We evaluate existing foundation models video understanding capabilities using a carefully designed experiment protocol consisting of three hallmark tasks (action recognition, temporal localization, and spatiotemporal localization), eight datasets well received by the community, and four adaptation methods tailoring a foundation model (FM) for a downstream task.

Action Recognition Temporal Localization +1

76,588

Paper
Code

Emergent Correspondence from Image Diffusion

1 code implementation • NeurIPS 2023 • Luming Tang, Menglin Jia, Qianqian Wang, Cheng Perng Phoo, Bharath Hariharan

We propose a simple strategy to extract this implicit knowledge out of diffusion networks as image features, namely DIffusion FeaTures (DIFT), and use them to establish correspondences between real images.

Semantic correspondence

476

Paper
Code

PromptFusion: Decoupling Stability and Plasticity for Continual Learning

no code implementations • 13 Mar 2023 • Haoran Chen, Zuxuan Wu, Xintong Han, Menglin Jia, Yu-Gang Jiang

Such a trade-off is referred to as the stabilityplasticity dilemma and is a more general and challenging problem for continual learning.

Class Incremental Learning Incremental Learning

Paper
Add Code

Searching for Structure in Unfalsifiable Claims

1 code implementation • 19 Aug 2022 • Peter Ebert Christensen, Frederik Warburg, Menglin Jia, Serge Belongie

In this work, we aim to distill such posts into a small set of narratives that capture the essential claims related to a given topic.

Fact Checking Topic Models

Paper
Code

Visual Prompt Tuning

6 code implementations • 23 Mar 2022 • Menglin Jia, Luming Tang, Bor-Chun Chen, Claire Cardie, Serge Belongie, Bharath Hariharan, Ser-Nam Lim

The current modus operandi in adapting pre-trained models involves updating all the backbone parameters, ie, full fine-tuning.

Ranked #2 on Prompt Engineering on ImageNet-21k

Image Classification Long-tail Learning +2

908

Paper
Code

Rethinking Nearest Neighbors for Visual Classification

1 code implementation • 15 Dec 2021 • Menglin Jia, Bor-Chun Chen, Zuxuan Wu, Claire Cardie, Serge Belongie, Ser-Nam Lim

In this paper, we investigate $k$-Nearest-Neighbor (k-NN) classifiers, a classical model-free learning method from the pre-deep learning era, as an augmentation to modern neural network based approaches.

Classification

Paper
Code

When in Doubt: Improving Classification Performance with Alternating Normalization

1 code implementation • Findings (EMNLP) 2021 • Menglin Jia, Austin Reiter, Ser-Nam Lim, Yoav Artzi, Claire Cardie

We introduce Classification with Alternating Normalization (CAN), a non-parametric post-processing step for classification.

Classification

Paper
Code

Exploring Visual Engagement Signals for Representation Learning

1 code implementation • ICCV 2021 • Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim

Visual engagement in social media platforms comprises interactions with photo posts including comments, shares, and likes.

Bias Detection Emotion Recognition +2

Paper
Code

Intentonomy: a Dataset and Study towards Human Intent Understanding

1 code implementation • CVPR 2021 • Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim

Based on our findings, we conduct further study to quantify the effect of attending to object and context classes as well as textual information in the form of hashtags when training an intent classifier.

Paper
Code

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

5 code implementations • ECCV 2020 • Menglin Jia, Mengyun Shi, Mikhail Sirotenko, Yin Cui, Claire Cardie, Bharath Hariharan, Hartwig Adam, Serge Belongie

In this work we explore the task of instance segmentation with attribute localization, which unifies instance segmentation (detect and segment each object instance) and fine-grained visual attribute categorization (recognize one or multiple attributes).

Attribute Fine-Grained Visual Categorization +5

5,176

Paper
Code

Deep Multi-Modal Sets

no code implementations • 3 Mar 2020 • Austin Reiter, Menglin Jia, Pu Yang, Ser-Nam Lim

Most deep learning-based methods rely on a late fusion technique whereby multiple feature types are encoded and concatenated and then a multi layer perceptron (MLP) combines the fused embedding to make predictions.

Paper
Add Code

Class-Balanced Loss Based on Effective Number of Samples

8 code implementations • CVPR 2019 • Yin Cui, Menglin Jia, Tsung-Yi Lin, Yang song, Serge Belongie

We design a re-weighting scheme that uses the effective number of samples for each class to re-balance the loss, thereby yielding a class-balanced loss.

Ranked #2 on Long-tail Learning on EGTEA

Image Classification Long-tail Learning

768

Paper
Code

A Deep-Learning-Based Fashion Attributes Detection Model

1 code implementation • 24 Oct 2018 • Menglin Jia, Yichen Zhou, Mengyun Shi, Bharath Hariharan

Such information analyzing process is called abstracting, which recognize similarities or differences across all the garments and collections.

Marketing

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.