Search Results for author: Lin Sun

Found 32 papers, 9 papers with code

UMIE: Unified Multimodal Information Extraction with Instruction Tuning

1 code implementation5 Jan 2024 Lin Sun, Kai Zhang, Qingyuan Li, Renze Lou

Multimodal information extraction (MIE) gains significant attention as the popularity of multimedia content increases.

BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge

no code implementations25 Dec 2023 Lin Sun, Weijun Wang, Tingting Yuan, Liang Mi, Haipeng Dai, Yunxin Liu, XiaoMing Fu

To achieve this goal, we propose BiSwift, a bi-level framework that scales the concurrent real-time video analytics by a novel adaptive hybrid codec integrated with multi-level pipelines, and a global bandwidth controller for multiple video streams.

Fairness Management +3

PathAsst: A Generative Foundation AI Assistant Towards Artificial General Intelligence of Pathology

1 code implementation24 May 2023 Yuxuan Sun, Chenglu Zhu, Sunyi Zheng, Kai Zhang, Lin Sun, Zhongyi Shui, Yunlong Zhang, Honglin Li, Lin Yang

Secondly, by leveraging the collected data, we construct PathCLIP, a pathology-dedicated CLIP, to enhance PathAsst's capabilities in interpreting pathology images.

Instruction Following Language Modelling +1

Indeterminate Probability Neural Network

1 code implementation21 Mar 2023 Tao Yang, Chuang Liu, Xiaofeng Ma, Weijia Lu, Ning Wu, Bingyang Li, Zhifei Yang, Peng Liu, Lin Sun, Xiaodong Zhang, Can Zhang

Besides, for our proposed neural network framework, the output of neural network is defined as probability events, and based on the statistical analysis of these events, the inference model for classification task is deduced.

Classification

PAGE: A Position-Aware Graph-Based Model for Emotion Cause Entailment in Conversation

1 code implementation3 Mar 2023 Xiaojie Gu, Renze Lou, Lin Sun, Shangxin Li

Conversational Causal Emotion Entailment (C2E2) is a task that aims at recognizing the causes corresponding to a target emotion in a conversation.

Causal Emotion Entailment Causal Inference +1

Prior land surface reflectance-based sandstorm detection from space using deep learning

no code implementations Frontiers in Earth Science 2022 Yu Qu, Lin Sun, Qing hua Su, Nan Ma, Zhi hui Wang, Xi rong Liu

Based on the dataset, the difference between the reflectance observed by the satellite and the corresponding LSR is generated, which is used as a characteristic parameter of sandstorm detection with the deep learning method.

Using EBGAN for Anomaly Intrusion Detection

no code implementations21 Jun 2022 Yi Cui, Wenfeng Shen, Jian Zhang, Weijia Lu, Chuang Liu, Lin Sun, Si Chen

The generator in IDS-EBGAN is responsible for converting the original malicious network traffic in the training set into adversarial malicious examples.

Intrusion Detection

VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention

1 code implementation CVPR 2022 Shengheng Deng, Zhihao Liang, Lin Sun, Kui Jia

These multi-view methods either refine the proposals predicted from single view via fused features, or fuse the features without considering the global spatial context; their performance is limited consequently.

3D Object Detection Autonomous Driving +1

Learning Category-level Shape Saliency via Deep Implicit Surface Networks

no code implementations14 Dec 2020 Chaozheng Wu, Lin Sun, Xun Xu, Kui Jia

Given the large shape variations among different instances of a same category, we are formally interested in developing a quantity defined for individual points on a continuous object surface; the quantity specifies how individual surface points contribute to the formation of the shape as the category.

Point Cloud Classification Saliency Prediction

RIVA: A Pre-trained Tweet Multimodal Model Based on Text-image Relation for Multimodal NER

no code implementations COLING 2020 Lin Sun, Jiquan Wang, Yindu Su, Fangsheng Weng, Yuxuan Sun, Zengwei Zheng, Yuanyi Chen

In the multimodal NER task, the experimental results show the significance of text-related visual features for the visual-linguistic model and our approach achieves SOTA performance on the MNER datasets.

named-entity-recognition Named Entity Recognition +3

Grasp Proposal Networks: An End-to-End Solution for Visual Learning of Robotic Grasps

2 code implementations NeurIPS 2020 Chaozheng Wu, Jian Chen, Qiaoyu Cao, Jianchi Zhang, Yunxin Tai, Lin Sun, Kui Jia

To test GPNet, we contribute a synthetic dataset of 6-DOF object grasps; evaluation is conducted using rule-based criteria, simulation test, and real test.

Real-Time Uncertainty Estimation in Computer Vision via Uncertainty-Aware Distribution Distillation

no code implementations31 Jul 2020 Yichen Shen, Zhilu Zhang, Mert R. Sabuncu, Lin Sun

We propose a simple, easy-to-optimize distillation method for learning the conditional predictive distribution of a pre-trained dropout model for fast, sample-free uncertainty estimation in computer vision tasks.

Depth Estimation Semantic Segmentation +1

Probabilistic Multi-modal Trajectory Prediction with Lane Attention for Autonomous Vehicles

no code implementations6 Jul 2020 Chenxu Luo, Lin Sun, Dariush Dabiri, Alan Yuille

As for vehicles, their trajectories are significantly influenced by the lane geometry and how to effectively use the lane information is of active interest.

Autonomous Vehicles Motion Forecasting +1

HRDNet: High-resolution Detection Network for Small Objects

no code implementations13 Jun 2020 Ziming Liu, Guangyu Gao, Lin Sun, Zhiyuan Fang

By extracting various features from high to low resolutions, the MD-IPN is able to improve the performance of small object detection as well as maintaining the performance of middle and large objects.

Object object-detection +2

IPG-Net: Image Pyramid Guidance Network for Small Object Detection

no code implementations2 Dec 2019 Ziming Liu, Guangyu Gao, Lin Sun, Li Fang

In this paper, except for top-down combining of information for shallow layers, we propose a novel network called Image Pyramid Guidance Network (IPG-Net) to make sure both the spatial information and semantic information are abundant for each layer.

object-detection Small Object Detection

Combinatorial Keyword Recommendations for Sponsored Search with Deep Reinforcement Learning

no code implementations18 Jul 2019 Zhipeng Li, Jianwei Wu, Lin Sun, Tao Rong

In sponsored search, keyword recommendations help advertisers to achieve much better performance within limited budget.

Clustering Combinatorial Optimization +2

TOI-CNN: a Solution of Information Extraction on Chinese Insurance Policy

no code implementations NAACL 2019 Lin Sun, Kai Zhang, Fule Ji, Zhenhua Yang

The advantage of TOI pooling layer is that the nested elements from one sentence could share computation and context in the forward and backward passes.

Sentence

Coupled Recurrent Network (CRN)

no code implementations25 Dec 2018 Lin Sun, Kui Jia, Yuejia Shen, Silvio Savarese, Dit Yan Yeung, Bertram E. Shi

To learn from these heterogenous input sources, existing methods reply on two-stream architectural designs that contain independent, parallel streams of Recurrent Neural Networks (RNNs).

Action Recognition In Videos Multi-Person Pose Estimation +2

Lattice Long Short-Term Memory for Human Action Recognition

no code implementations ICCV 2017 Lin Sun, Kui Jia, Kevin Chen, Dit Yan Yeung, Bertram E. Shi, Silvio Savarese

This method effectively enhances the ability to model dynamics across time and addresses the non-stationary issue of long-term motion dynamics without significantly increasing the model complexity.

Action Recognition Optical Flow Estimation +1

Fine-Grained Categorization via CNN-Based Automatic Extraction and Integration of Object-Level and Part-Level Features

no code implementations22 Jun 2017 Ting Sun, Lin Sun, Dit-yan Yeung

Fine-grained categorization can benefit from part-based features which reveal subtle visual differences between object categories.

Object

Multilingual Metaphor Processing: Experiments with Semi-Supervised and Unsupervised Learning

no code implementations CL 2017 Ekaterina Shutova, Lin Sun, Elkin Dar{\'\i}o Guti{\'e}rrez, Patricia Lichtenstein, Srini Narayanan

We investigate different levels and types of supervision (learning from linguistic examples vs. learning from a given set of metaphorical mappings vs. learning without annotation) in flat and hierarchical, unconstrained and constrained clustering settings.

Constrained Clustering

Feedback Networks

1 code implementation CVPR 2017 Amir R. Zamir, Te-Lin Wu, Lin Sun, William Shen, Jitendra Malik, Silvio Savarese

Currently, the most successful learning models in computer vision are based on learning successive representations followed by a decision layer.

Human Action Recognition using Factorized Spatio-Temporal Convolutional Networks

no code implementations ICCV 2015 Lin Sun, Kui Jia, Dit-yan Yeung, Bertram E. Shi

Human actions in video sequences are three-dimensional (3D) spatio-temporal signals characterizing both the visual appearance and motion dynamics of the involved humans and objects.

Action Recognition Image Classification +1

DL-SFA: Deeply-Learned Slow Feature Analysis for Action Recognition

no code implementations CVPR 2014 Lin Sun, Kui Jia, Tsung-Han Chan, Yuqiang Fang, Gang Wang, Shuicheng Yan

In this paper, we propose to combine SFA with deep learning techniques to learn hierarchical representations from the video data itself.

Action Recognition Temporal Action Localization

Native Language Identification Using Large, Longitudinal Data

no code implementations LREC 2014 Xiao Jiang, Yufan Guo, Jeroen Geertzen, Dora Alexopoulou, Lin Sun, Anna Korhonen

Native Language Identification (NLI) is a task aimed at determining the native language (L1) of learners of second language (L2) on the basis of their written texts.

BIG-bench Machine Learning Native Language Identification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.