Search Results for author: Sen yang

Found 79 papers, 33 papers with code

Artificial Intelligence-Enhanced Couinaud Segmentation for Precision Liver Cancer Therapy

no code implementations5 Nov 2024 Liang Qiu, Wenhao Chi, Xiaohan Xing, Praveenbalaji Rajendran, Mingjie Li, Yuming Jiang, Oscar Pastor-Serrano, Sen yang, Xiyue Wang, Yuanfeng Ji, Qiang Wen

Precision therapy for liver cancer necessitates accurately delineating liver sub-regions to protect healthy tissue while targeting tumors, which is essential for reducing recurrence and improving survival rates.

Data Augmentation Segmentation

Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies

no code implementations24 Oct 2024 Luping Wang, Sheng Chen, Linnan Jiang, Shu Pan, Runze Cai, Sen yang, Fei Yang

When adapting large models to specific downstream tasks, their massive parameter scale poses a significant challenge in fine-tuning on hardware platforms with limited computational power and GPU memory.

parameter-efficient fine-tuning Text Generation

LoGU: Long-form Generation with Uncertainty Expressions

1 code implementation18 Oct 2024 Ruihan Yang, Caiqi Zhang, Zhisong Zhang, Xinting Huang, Sen yang, Nigel Collier, Dong Yu, Deqing Yang

To tackle these challenges, we propose a refinement-based data collection framework and a two-stage training pipeline.

Instruction Following

Atomic Calibration of LLMs in Long-Form Generations

no code implementations17 Oct 2024 Caiqi Zhang, Ruihan Yang, Zhisong Zhang, Xinting Huang, Sen yang, Dong Yu, Nigel Collier

Existing research on LLM calibration has primarily focused on short-form tasks, providing a single confidence score at the response level (macro calibration).

ChatHouseDiffusion: Prompt-Guided Generation and Editing of Floor Plans

1 code implementation15 Oct 2024 Sizhong Qin, Chengyu He, Qiaoyun Chen, Sen yang, Wenjie Liao, Yi Gu, Xinzheng Lu

The generation and editing of floor plans are critical in architectural planning, requiring a high degree of flexibility and efficiency.

Conditional Image Generation

MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction

no code implementations10 Oct 2024 Jing Yang, Minyue Jiang, Sen yang, Xiao Tan, YingYing Li, Errui Ding, Hanli Wang, Jingdong Wang

The construction of Vectorized High-Definition (HD) map typically requires capturing both category and geometry information of map elements.

Representation Learning

Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous Driving

1 code implementation24 Sep 2024 Lingyu Xiao, Jiang-Jiang Liu, Sen yang, Xiaofan Li, Xiaoqing Ye, Wankou Yang, Jingdong Wang

In this paper, we explore the feasibility of deriving decisions from an autoregressive world model by addressing these challenges through the formulation of multiple probabilistic hypotheses.

Autonomous Driving Imitation Learning +1

Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning

no code implementations25 Jun 2024 Sen yang, Leyang Cui, Deng Cai, Xinting Huang, Shuming Shi, Wai Lam

Iterative preference learning, though yielding superior performances, requires online annotated preference labels.

CMTNet: Convolutional Meets Transformer Network for Hyperspectral Images Classification

no code implementations20 Jun 2024 Faxu Guo, Quan Feng, Sen yang, Wanxia Yang

Hyperspectral remote sensing (HIS) enables the detailed capture of spectral information from the Earth's surface, facilitating precise classification and identification of surface crops due to its superior spectral diagnostic capabilities.

Classification Crop Classification

MaskMatch: Boosting Semi-Supervised Learning Through Mask Autoencoder-Driven Feature Learning

no code implementations10 May 2024 Wenjin Zhang, Keyi Li, Sen yang, Chenyang Gao, Wanzhao Yang, Sifan Yuan, Ivan Marsic

To overcome this limitation and improve SSL performance, we introduce \algo, a novel algorithm that fully utilizes unlabeled data to boost semi-supervised learning.

Representation Learning Self-Supervised Learning

2L3: Lifting Imperfect Generated 2D Images into Accurate 3D

no code implementations29 Jan 2024 Yizheng Chen, Rengan Xie, Qi Ye, Sen yang, Zixuan Xie, Tianxiao Chen, Rong Li, Yuchi Huo

Specifically, we first leverage to decouple the shading information from the generated images to reduce the impact of inconsistent lighting; then, we introduce mono prior with view-dependent transient encoding to enhance the reconstructed normal; and finally, we design a view augmentation fusion strategy that minimizes pixel-level loss in generated sparse views and semantic loss in augmented random views, resulting in view-consistent geometry and detailed textures.

3D Object Reconstruction 3D Reconstruction +1

Multi-Candidate Speculative Decoding

1 code implementation12 Jan 2024 Sen yang, ShuJian Huang, Xinyu Dai, Jiajun Chen

One way to speed them up is speculative decoding, which generates candidate segments (a sequence of tokens) from a fast draft model that is then verified in parallel by the target model.

SeaLLMs -- Large Language Models for Southeast Asia

1 code implementation1 Dec 2023 Xuan-Phi Nguyen, Wenxuan Zhang, Xin Li, Mahani Aljunied, Zhiqiang Hu, Chenhui Shen, Yew Ken Chia, Xingxuan Li, Jianyu Wang, Qingyu Tan, Liying Cheng, Guanzheng Chen, Yue Deng, Sen yang, Chaoqun Liu, Hang Zhang, Lidong Bing

Despite the remarkable achievements of large language models (LLMs) in various tasks, there remains a linguistic bias that favors high-resource languages, such as English, often at the expense of low-resource and regional languages.

Instruction Following

Once Upon a $\textit{Time}$ in $\textit{Graph}$: Relative-Time Pretraining for Complex Temporal Reasoning

1 code implementation23 Oct 2023 Sen yang, Xin Li, Lidong Bing, Wai Lam

However, the knowledge-time association is usually insufficient for the downstream tasks that require reasoning over temporal dependencies between knowledge.

Question Answering

ADNet: Lane Shape Prediction via Anchor Decomposition

2 code implementations ICCV 2023 Lingyu Xiao, Xiang Li, Sen yang, Wankou Yang

In this paper, we revisit the limitations of anchor-based lane detection methods, which have predominantly focused on fixed anchors that stem from the edges of the image, disregarding their versatility and quality.

Lane Detection

SNR-based beaconless multi-scan link acquisition model with vibration for LEO-to-ground laser communication

no code implementations6 Aug 2023 Sen yang, Xiaofeng Li

Specially, under the combined effects of platform vibration and turbulence, we decouple the parameters of beam divergence angle, spiral pitch, and coverage factor at a fixed transmitted power for a given average received SNR threshold.

Enhancing Grammatical Error Correction Systems with Explanations

1 code implementation25 May 2023 Yuejiao Fei, Leyang Cui, Sen yang, Wai Lam, Zhenzhong Lan, Shuming Shi

Grammatical error correction systems improve written communication by detecting and correcting language mistakes.

Grammatical Error Correction

Why is the winner the best?

no code implementations CVPR 2023 Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Sharib Ali, Vincent Andrearczyk, Marc Aubreville, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano, Jorge Bernal, Sebastian Bodenstedt, Alessandro Casella, Veronika Cheplygina, Marie Daum, Marleen de Bruijne, Adrien Depeursinge, Reuben Dorent, Jan Egger, David G. Ellis, Sandy Engelhardt, Melanie Ganz, Noha Ghatwary, Gabriel Girard, Patrick Godau, Anubha Gupta, Lasse Hansen, Kanako Harada, Mattias Heinrich, Nicholas Heller, Alessa Hering, Arnaud Huaulmé, Pierre Jannin, Ali Emre Kavur, Oldřich Kodym, Michal Kozubek, Jianning Li, Hongwei Li, Jun Ma, Carlos Martín-Isla, Bjoern Menze, Alison Noble, Valentin Oreiller, Nicolas Padoy, Sarthak Pati, Kelly Payette, Tim Rädsch, Jonathan Rafael-Patiño, Vivek Singh Bawa, Stefanie Speidel, Carole H. Sudre, Kimberlin Van Wijnen, Martin Wagner, Donglai Wei, Amine Yamlahi, Moi Hoon Yap, Chun Yuan, Maximilian Zenk, Aneeq Zia, David Zimmerer, Dogu Baran Aydogan, Binod Bhattarai, Louise Bloch, Raphael Brüngel, Jihoon Cho, Chanyeol Choi, Qi Dou, Ivan Ezhov, Christoph M. Friedrich, Clifton Fuller, Rebati Raman Gaire, Adrian Galdran, Álvaro García Faura, Maria Grammatikopoulou, SeulGi Hong, Mostafa Jahanifar, Ikbeom Jang, Abdolrahim Kadkhodamohammadi, Inha Kang, Florian Kofler, Satoshi Kondo, Hugo Kuijf, Mingxing Li, Minh Huan Luu, Tomaž Martinčič, Pedro Morais, Mohamed A. Naser, Bruno Oliveira, David Owen, Subeen Pang, Jinah Park, Sung-Hong Park, Szymon Płotka, Elodie Puybareau, Nasir Rajpoot, Kanghyun Ryu, Numan Saeed, Adam Shephard, Pengcheng Shi, Dejan Štepec, Ronast Subedi, Guillaume Tochon, Helena R. Torres, Helene Urien, João L. Vilaça, Kareem Abdul Wahid, Haojie Wang, Jiacheng Wang, Liansheng Wang, Xiyue Wang, Benedikt Wiestler, Marek Wodzinski, Fangfang Xia, Juanying Xie, Zhiwei Xiong, Sen yang, Yanwu Yang, Zixuan Zhao, Klaus Maier-Hein, Paul F. Jäger, Annette Kopp-Schneider, Lena Maier-Hein

The "typical" lead of a winning team is a computer scientist with a doctoral degree, five years of experience in biomedical image analysis, and four years of experience in deep learning.

Benchmarking Multi-Task Learning

CoNIC Challenge: Pushing the Frontiers of Nuclear Detection, Segmentation, Classification and Counting

1 code implementation11 Mar 2023 Simon Graham, Quoc Dang Vu, Mostafa Jahanifar, Martin Weigert, Uwe Schmidt, Wenhua Zhang, Jun Zhang, Sen yang, Jinxi Xiang, Xiyue Wang, Josef Lorenz Rumberger, Elias Baumann, Peter Hirsch, Lihao Liu, Chenyang Hong, Angelica I. Aviles-Rivero, Ayushi Jain, Heeyoung Ahn, Yiyu Hong, Hussam Azzuni, Min Xu, Mohammad Yaqub, Marie-Claire Blache, Benoît Piégu, Bertrand Vernay, Tim Scherr, Moritz Böhland, Katharina Löffler, Jiachen Li, Weiqin Ying, Chixin Wang, Dagmar Kainmueller, Carola-Bibiane Schönlieb, Shuolin Liu, Dhairya Talsania, Yughender Meda, Prakash Mishra, Muhammad Ridzuan, Oliver Neumann, Marcel P. Schilling, Markus Reischl, Ralf Mikut, Banban Huang, Hsiang-Chin Chien, Ching-Ping Wang, Chia-Yen Lee, Hong-Kun Lin, Zaiyi Liu, Xipeng Pan, Chu Han, Jijun Cheng, Muhammad Dawood, Srijay Deshpande, Raja Muhammad Saad Bashir, Adam Shephard, Pedro Costa, João D. Nunes, Aurélio Campilho, Jaime S. Cardoso, Hrishikesh P S, Densen Puthussery, Devika R G, Jiji C V, Ye Zhang, Zijie Fang, Zhifan Lin, Yongbing Zhang, Chunhui Lin, Liukun Zhang, Lijian Mao, Min Wu, Vi Thi-Tuong Vo, Soo-Hyung Kim, Taebum Lee, Satoshi Kondo, Satoshi Kasai, Pranay Dumbhare, Vedant Phuse, Yash Dubey, Ankush Jamthikar, Trinh Thi Le Vuong, Jin Tae Kwak, Dorsa Ziaei, Hyun Jung, Tianyi Miao, David Snead, Shan E Ahmed Raza, Fayyaz Minhas, Nasir M. Rajpoot

Nuclear detection, segmentation and morphometric profiling are essential in helping us further understand the relationship between histology and patient outcome.

Nuclear Segmentation Segmentation +2

Federated attention consistent learning models for prostate cancer diagnosis and Gleason grading

no code implementations13 Feb 2023 Fei Kong, Xiyue Wang, Jinxi Xiang, Sen yang, Xinran Wang, Meng Yue, Jun Zhang, Junhan Zhao, Xiao Han, Yuhan Dong, Biyue Zhu, Fang Wang, Yueping Liu

We assessed the effectiveness of FACL in cancer diagnosis and Gleason grading tasks using 19, 461 whole-slide images of prostate cancer from multiple centers.

Contrastive Learning Ethics +2

Artificial intelligence for diagnosing and predicting survival of patients with renal cell carcinoma: Retrospective multi-center study

no code implementations12 Jan 2023 Siteng Chen, Xiyue Wang, Jun Zhang, Liren Jiang, Ning Zhang, Feng Gao, Wei Yang, Jinxi Xiang, Sen yang, Junhua Zheng, Xiao Han

The OSrisk for the prediction of 5-year survival status achieved AUC of 0. 784 (0. 746-0. 819) in the TCGA cohort, which was further verified in the independent General cohort and the CPTAC cohort, with AUC of 0. 774 (0. 723-0. 820) and 0. 702 (0. 632-0. 765), respectively.

whole slide images

DeepNoise: Signal and Noise Disentanglement based on Classifying Fluorescent Microscopy Images via Deep Learning

1 code implementation13 Sep 2022 Sen yang, Tao Shen, Yuqi Fang, Xiyue Wang, Jun Zhang, Wei Yang, Junzhou Huang, Xiao Han

The high-content image-based assay is commonly leveraged for identifying the phenotypic impact of genetic perturbations in biology field.

Disentanglement Drug Discovery +1

LPFS: Learnable Polarizing Feature Selection for Click-Through Rate Prediction

1 code implementation1 Jun 2022 Yi Guo, Zhaocheng Liu, Jianchao Tan, Chao Liao, Sen yang, Lei Yuan, Dongying Kong, Zhi Chen, Ji Liu

When training is finished, some gates are exact zero, while others are around one, which is particularly favored by the practical hot-start training in the industry, due to no damage to the model performance before and after removing the features corresponding to exact-zero gates.

Click-Through Rate Prediction feature selection

Nuclear Norm Maximization Based Curiosity-Driven Learning

no code implementations21 May 2022 Chao Chen, Zijian Gao, Kele Xu, Sen yang, Yiying Li, Bo Ding, Dawei Feng, Huaimin Wang

To handle the sparsity of the extrinsic rewards in reinforcement learning, researchers have proposed intrinsic reward which enables the agent to learn the skills that might come in handy for pursuing the rewards in the future, such as encouraging the agent to visit novel states.

Atari Games

Deep learning-based approach to reveal tumor mutational burden status from whole slide images across multiple cancer types

no code implementations7 Apr 2022 Siteng Chen, Jinxi Xiang, Xiyue Wang, Jun Zhang, Sen yang, Junzhou Huang, Wei Yang, Junhua Zheng, Xiao Han

MC-TMB algorithm also exhibited good generalization on the external validation cohort with an AUC of 0. 732 (0. 683-0. 761), and better performance when compared to other methods.

whole slide images

Generating Privacy-Preserving Process Data with Deep Generative Models

no code implementations15 Mar 2022 Keyi Li, Sen yang, Travis M. Sullivan, Randall S. Burd, Ivan Marsic

We experimented with different models of representation learning and used the learned model to generate synthetic process data.

Privacy Preserving Representation Learning

Do Prompts Solve NLP Tasks Using Natural Language?

no code implementations2 Mar 2022 Sen yang, Yunchen Zhang, Leyang Cui, Yue Zhang

Thanks to the advanced improvement of large pre-trained language models, prompt-based fine-tuning is shown to be effective on a variety of downstream tasks.

Adversarial Contrastive Self-Supervised Learning

no code implementations26 Feb 2022 Wentao Zhu, Hang Shang, Tingxun Lv, Chao Liao, Sen yang, Ji Liu

Recently, learning from vast unlabeled data, especially self-supervised learning, has been emerging and attracted widespread attention.

Self-Supervised Learning Triplet

Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation

1 code implementation21 Feb 2022 Jiawei Liu, Yuxiang Wei, Sen yang, Yinlin Deng, Lingming Zhang

Our results show that Tzer substantially outperforms existing fuzzing techniques on tensor compiler testing, with 75% higher coverage and 50% more valuable tests than the 2nd-best technique.

Multi-task Pre-training Language Model for Semantic Network Completion

1 code implementation13 Jan 2022 Da Li, Sen yang, Kele Xu, Ming Yi, Yukai He, Huaimin Wang

To demonstrate the effectiveness of our method, we conduct extensive experiments on three widely-used datasets, WN18RR, FB15k-237, and UMLS.

Contrastive Learning Data Augmentation +3

Node-Aligned Graph Convolutional Network for Whole-Slide Image Representation and Classification

1 code implementation CVPR 2022 Yonghang Guan, Jun Zhang, Kuan Tian, Sen yang, Pei Dong, Jinxi Xiang, Wei Yang, Junzhou Huang, Yuyao Zhang, Xiao Han

In this paper, we propose a hierarchical global-to-local clustering strategy to build a Node-Aligned GCN (NAGCN) to represent WSI with rich local structural information as well as global distribution.

Clustering graph construction +2

TNASP: A Transformer-based NAS Predictor with a Self-evolution Framework

no code implementations NeurIPS 2021 Shun Lu, Jixiang Li, Jianchao Tan, Sen yang, Ji Liu

Predictor-based Neural Architecture Search (NAS) continues to be an important topic because it aims to mitigate the time-consuming search procedure of traditional NAS methods.

Neural Architecture Search

Persia: An Open, Hybrid System Scaling Deep Learning-based Recommenders up to 100 Trillion Parameters

1 code implementation10 Nov 2021 Xiangru Lian, Binhang Yuan, XueFeng Zhu, Yulong Wang, Yongjun He, Honghuan Wu, Lei Sun, Haodong Lyu, Chengjun Liu, Xing Dong, Yiqiao Liao, Mingnan Luo, Congfei Zhang, Jingru Xie, Haonan Li, Lei Chen, Renjie Huang, Jianying Lin, Chengchun Shu, Xuezhong Qiu, Zhishan Liu, Dongying Kong, Lei Yuan, Hai Yu, Sen yang, Ce Zhang, Ji Liu

Specifically, in order to ensure both the training efficiency and the training accuracy, we design a novel hybrid training algorithm, where the embedding layer and the dense neural network are handled by different synchronization mechanisms; then we build a system called Persia (short for parallel recommendation training system with hybrid acceleration) to support this hybrid training algorithm.

Recommendation Systems

Joint Channel and Weight Pruning for Model Acceleration on Moblie Devices

1 code implementation15 Oct 2021 Tianli Zhao, Xi Sheryl Zhang, Wentao Zhu, Jiaxing Wang, Sen yang, Ji Liu, Jian Cheng

In this paper, we present a unified framework with Joint Channel pruning and Weight pruning (JCW), and achieves a better Pareto-frontier between the latency and accuracy than previous model compression approaches.

Model Compression

Investigating Non-local Features for Neural Constituency Parsing

1 code implementation ACL 2022 Leyang Cui, Sen yang, Yue Zhang

Besides, our method achieves state-of-the-art BERT-based performance on PTB (95. 92 F1) and strong performance on CTB (92. 31 F1).

Constituency Parsing

Deep Learning Model for Demodulation Reference Signal based Channel Estimation

no code implementations22 Sep 2021 Yu Tian, Chengguang Li, Sen yang

In this paper, we propose a deep learning model for Demodulation Reference Signal (DMRS) based channel estimation task.

Deep Learning

SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification

1 code implementation18 Sep 2021 Wentao Zhu, Tianlong Kong, Shun Lu, Jixiang Li, Dawei Zhang, Feng Deng, Xiaorui Wang, Sen yang, Ji Liu

Recently, x-vector has been a successful and popular approach for speaker verification, which employs a time delay neural network (TDNN) and statistics pooling to extract speaker characterizing embedding from variable-length utterances.

Neural Architecture Search Speaker Recognition +2

GDP: Stabilized Neural Network Pruning via Gates with Differentiable Polarization

no code implementations ICCV 2021 Yi Guo, Huan Yuan, Jianchao Tan, Zhangyang Wang, Sen yang, Ji Liu

During the training process, the polarization effect will drive a subset of gates to smoothly decrease to exact zero, while other gates gradually stay away from zero by a large margin.

Model Compression Network Pruning

Sk-Unet Model with Fourier Domain for Mitosis Detection

no code implementations1 Sep 2021 Sen yang, Feng Luo, Jun Zhang, Xiyue Wang

Mitotic count is the most important morphological feature of breast cancer grading.

Mitosis Detection Segmentation

Shifted Chunk Transformer for Spatio-Temporal Representational Learning

no code implementations NeurIPS 2021 Xuefan Zha, Wentao Zhu, Tingxun Lv, Sen yang, Ji Liu

However, the pure-Transformer based spatio-temporal learning can be prohibitively costly on memory and computation to extract fine-grained features from a tiny patch.

Action Anticipation Action Recognition +4

PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic

no code implementations20 Aug 2021 Weicong Ding, Hanlin Tang, Jingshuo Feng, Lei Yuan, Sen yang, Guangxu Yang, Jie Zheng, Jing Wang, Qiang Su, Dong Zheng, Xuezhong Qiu, Yongqi Liu, Yuxuan Chen, Yang Liu, Chao Song, Dongying Kong, Kai Ren, Peng Jiang, Qiao Lian, Ji Liu

In this setting with multiple and constrained goals, this paper discovers that a probabilistic strategic parameter regime can achieve better value compared to the standard regime of finding a single deterministic parameter.

Recommendation Systems

POSO: Personalized Cold Start Modules for Large-scale Recommender Systems

no code implementations10 Aug 2021 Shangfeng Dai, Haobin Lin, Zhichen Zhao, Jianying Lin, Honghuan Wu, Zhe Wang, Sen yang, Ji Liu

Moreover, POSO can be further generalized to regular users, inactive users and returning users (+2%-3% on Watch Time), as well as item cold start (+3. 8% on Watch Time).

Recommendation Systems

FINT: Field-aware INTeraction Neural Network For CTR Prediction

1 code implementation5 Jul 2021 Zhishan Zhao, Sen yang, Guohui Liu, Dawei Feng, Kele Xu

As a critical component for online advertising and marking, click-through rate (CTR) prediction has draw lots of attentions from both industry and academia field.

Click-Through Rate Prediction

Template-Based Named Entity Recognition Using BART

1 code implementation Findings (ACL) 2021 Leyang Cui, Yu Wu, Jian Liu, Sen yang, Yue Zhang

To address the issue, we propose a template-based method for NER, treating NER as a language model ranking problem in a sequence-to-sequence framework, where original sentences and statement templates filled by candidate named entity span are regarded as the source sequence and the target sequence, respectively.

Few-shot NER Language Modelling +2

TokenPose: Learning Keypoint Tokens for Human Pose Estimation

1 code implementation ICCV 2021 YanJie Li, Shoukui Zhang, Zhicheng Wang, Sen yang, Wankou Yang, Shu-Tao Xia, Erjin Zhou

Most existing CNN-based methods do well in visual representation, however, lacking in the ability to explicitly learn the constraint relationships between keypoints.

Pose Estimation

TransPose: Keypoint Localization via Transformer

1 code implementation ICCV 2021 Sen yang, Zhibin Quan, Mu Nie, Wankou Yang

While CNN-based models have made remarkable progress on human pose estimation, what spatial dependencies they capture to localize keypoints remains unclear.

Ranked #3 on Pose Estimation on OCHuman (Validation AP metric)

Keypoint Detection Multi-Person Pose Estimation

Ensemble Chinese End-to-End Spoken Language Understanding for Abnormal Event Detection from audio stream

no code implementations19 Oct 2020 Haoran Wei, Fei Tao, Runze Su, Sen yang, Ji Liu

Previous end-to-end SLU models are primarily used for English environment due to lacking large scale SLU dataset in Chines, and use only one ASR model to extract features from speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

What Have We Achieved on Text Summarization?

1 code implementation EMNLP 2020 Dandan Huang, Leyang Cui, Sen yang, Guangsheng Bao, Kun Wang, Jun Xie, Yue Zhang

Deep learning has led to significant improvement in text summarization with various methods investigated and improved ROUGE scores reported over the years.

Text Summarization

Leveraging Adversarial Training in Self-Learning for Cross-Lingual Text Classification

no code implementations29 Jul 2020 Xin Dong, Yaxin Zhu, Yupeng Zhang, Zuohui Fu, Dongkuan Xu, Sen yang, Gerard de Melo

The resulting model then serves as a teacher to induce labels for unlabeled target language samples that can be used during further adversarial training, allowing us to gradually adapt our model to the target language.

General Classification intent-classification +4

Event Arguments Extraction via Dilate Gated Convolutional Neural Network with Enhanced Local Features

no code implementations2 Jun 2020 Zhigang Kan, Linbo Qiao, Sen yang, Feng Liu, Feng Huang

However, the F-Score of event arguments extraction is much lower than that of event trigger extraction, i. e. in the most recent work, event trigger extraction achieves 80. 7%, while event arguments extraction achieves only 58%.

Event Extraction

Pose Neural Fabrics Search

2 code implementations16 Sep 2019 Sen Yang, Wankou Yang, Zhen Cui

Neural Architecture Search (NAS) technologies have emerged in many domains to jointly learn the architectures and weights of the neural network.

Image Classification Keypoint Detection +3

Exploring Pre-trained Language Models for Event Extraction and Generation

no code implementations ACL 2019 Sen Yang, Dawei Feng, Linbo Qiao, Zhigang Kan, Dongsheng Li

Traditional approaches to the task of ACE event extraction usually depend on manually annotated data, which is often laborious to create and limited in size.

Event Extraction General Classification

On the Linear Speedup Analysis of Communication Efficient Momentum SGD for Distributed Non-Convex Optimization

no code implementations9 May 2019 Hao Yu, Rong Jin, Sen yang

Recent developments on large-scale distributed machine learning applications, e. g., deep neural networks, benefit enormously from the advances in distributed non-convex optimization techniques, e. g., distributed Stochastic Gradient Descent (SGD).

BIG-bench Machine Learning

Shrinking the Upper Confidence Bound: A Dynamic Product Selection Problem for Urban Warehouses

no code implementations19 Mar 2019 Rong Jin, David Simchi-Levi, Li Wang, Xinshang Wang, Sen Yang

In this paper, we study algorithms for dynamically identifying a large number of products (i. e., SKUs) with top customer purchase probabilities on the fly, from an ocean of potential products to offer on retailers' ultra-fast delivery platforms.

Parallel Restarted SGD with Faster Convergence and Less Communication: Demystifying Why Model Averaging Works for Deep Learning

no code implementations17 Jul 2018 Hao Yu, Sen yang, Shenghuo Zhu

Ideally, parallel mini-batch SGD can achieve a linear speed-up of the training time (with respect to the number of workers) compared with SGD over a single worker.

Learning with Non-Convex Truncated Losses by SGD

no code implementations21 May 2018 Yi Xu, Shenghuo Zhu, Sen yang, Chi Zhang, Rong Jin, Tianbao Yang

Learning with a {\it convex loss} function has been a dominating paradigm for many years.

Process-oriented Iterative Multiple Alignment for Medical Process Mining

no code implementations16 Sep 2017 Shuhong Chen, Sen yang, Moliang Zhou, Randall S. Burd, Ivan Marsic

We applied PIMA to analyzing medical workflow data, showing how iterative alignment can better represent the data and facilitate the extraction of insights from data visualization.

Data Visualization

Multi-task Vector Field Learning

no code implementations NeurIPS 2012 Binbin Lin, Sen yang, Chiyuan Zhang, Jieping Ye, Xiaofei He

MTVFL has the following key properties: (1) the vector fields we learned are close to the gradient fields of the prediction functions; (2) within each task, the vector field is required to be as parallel as possible which is expected to span a low dimensional subspace; (3) the vector fields from all tasks share a low dimensional subspace.

Multi-Task Learning

Fused Multiple Graphical Lasso

no code implementations10 Sep 2012 Sen Yang, Zhaosong Lu, Xiaotong Shen, Peter Wonka, Jieping Ye

We expect the two brain networks for NC and MCI to share common structures but not to be identical to each other; similarly for the two brain networks for MCI and AD.

Cannot find the paper you are looking for? You can Submit a new open access paper.