Search Results for author: Yu Shi

Found 55 papers, 19 papers with code

基于义原表示学习的词向量表示方法(Word Representation based on Sememe Representation Learning)

no code implementations CCL 2021 Ning Yu, Jiangping Wang, Yu Shi, Jianyi Liu

“本文利用知网(HowNet)中的知识, 并将Word2vec模型的结构和思想迁移至义原表示学习过程中, 提出了一个基于义原表示学习的词向量表示方法。首先, 本文利用OpenHowNet获取义原知识库中的所有义原、所有中文词汇以及所有中文词汇和其对应的义原集合, 作为实验的数据集。然后, 基于Skip-gram模型, 训练义原表示学习模型, 进而获得词向量。最后, 通过词相似度任务、词义消歧任务、词汇类比和观察最近邻义原, 来评价本文提出的方法获取的词向量的效果。通过和基线模型比较, 发现本文提出的方法既高效又准确, 不依赖大规模语料也不需要复杂的网络结构和繁多的参数, 也能提升各种自然语言处理任务的准确率。”

Representation Learning

Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations

1 code implementation27 Feb 2024 Jiaqi Zhai, Lucy Liao, Xing Liu, Yueming Wang, Rui Li, Xuan Cao, Leon Gao, Zhaojie Gong, Fangda Gu, Michael He, Yinghai Lu, Yu Shi

Large-scale recommendation systems are characterized by their reliance on high cardinality, heterogeneous features and the need to handle tens of billions of user actions on a daily basis.

 Ranked #1 on Recommendation Systems on MovieLens 20M (HR@10 (full corpus) metric)

Recommendation Systems

Robust Representation Learning for Unreliable Partial Label Learning

no code implementations31 Aug 2023 Yu Shi, Dong-Dong Wu, Xin Geng, Min-Ling Zhang

This is known as Unreliable Partial Label Learning (UPLL) that introduces an additional complexity due to the inherent unreliability and ambiguity of partial labels, often resulting in a sub-optimal performance with existing methods.

Contrastive Learning Partial Label Learning +2

SLPT: Selective Labeling Meets Prompt Tuning on Label-Limited Lesion Segmentation

no code implementations9 Aug 2023 Fan Bai, Ke Yan, Xiaoyu Bai, Xinyu Mao, Xiaoli Yin, Jingren Zhou, Yu Shi, Le Lu, Max Q. -H. Meng

We evaluate our method on liver tumor segmentation and achieve state-of-the-art performance, outperforming traditional fine-tuning with only 6% of tunable parameters, also achieving 94% of full-data performance by labeling only 5% of the data.

Lesion Segmentation Tumor Segmentation +1

Improved Prognostic Prediction of Pancreatic Cancer Using Multi-Phase CT by Integrating Neural Distance and Texture-Aware Transformer

no code implementations1 Aug 2023 Hexin Dong, Jiawen Yao, Yuxing Tang, Mingze Yuan, Yingda Xia, Jian Zhou, Hong Lu, Jingren Zhou, Bin Dong, Le Lu, Li Zhang, Zaiyi Liu, Yu Shi, Ling Zhang

Pancreatic ductal adenocarcinoma (PDAC) is a highly lethal cancer in which the tumor-vascular involvement greatly affects the resectability and, thus, overall survival of patients.

Towards Predicting Equilibrium Distributions for Molecular Systems with Deep Learning

no code implementations8 Jun 2023 Shuxin Zheng, Jiyan He, Chang Liu, Yu Shi, Ziheng Lu, Weitao Feng, Fusong Ju, Jiaxi Wang, Jianwei Zhu, Yaosen Min, He Zhang, Shidi Tang, Hongxia Hao, Peiran Jin, Chi Chen, Frank Noé, Haiguang Liu, Tie-Yan Liu

In this paper, we introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems.

Breaking the Curse of Quality Saturation with User-Centric Ranking

no code implementations24 May 2023 Zhuokai Zhao, Yang Yang, Wenyu Wang, Chihuang Liu, Yu Shi, Wenjie Hu, Haotian Zhang, Shuang Yang

A key puzzle in search, ads, and recommendation is that the ranking model can only utilize a small portion of the vastly available user interaction data.

i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data

no code implementations21 May 2023 ZiYi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Mei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang

The convergence of text, visual, and audio data is a key step towards human-like artificial intelligence, however the current Vision-Language-Speech landscape is dominated by encoder-only models which lack generative abilities.

Unreliable Partial Label Learning with Recursive Separation

no code implementations20 Feb 2023 Yu Shi, Ning Xu, Hua Yuan, Xin Geng

Therefore, a generalized PLL named Unreliable Partial Label Learning (UPLL) is proposed, in which the true label may not be in the candidate label set.

Partial Label Learning Weakly-supervised Learning

Efficient End-to-End Video Question Answering with Pyramidal Multimodal Transformer

1 code implementation4 Feb 2023 Min Peng, Chongyang Wang, Yu Shi, Xiang-Dong Zhou

This paper presents a new method for end-to-end Video Question Answering (VideoQA), aside from the current popularity of using large-scale pre-training with huge feature extractors.

Computational Efficiency Question Answering +4

A deep local attention network for pre-operative lymph node metastasis prediction in pancreatic cancer via multiphase CT imaging

no code implementations4 Jan 2023 Zhilin Zheng, Xu Fang, Jiawen Yao, Mengmeng Zhu, Le Lu, Lingyun Huang, Jing Xiao, Yu Shi, Hong Lu, Jianping Lu, Ling Zhang, Chengwei Shao, Yun Bian

Lymph node (LN) metastasis status is one of the most critical prognostic and cancer staging factors for patients with resectable pancreatic ductal adenocarcinoma (PDAC), or in general, for any types of solid malignant tumors.


Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts

no code implementations11 Nov 2022 Xiaofei Wang, Zhuo Chen, Yu Shi, Jian Wu, Naoyuki Kanda, Takuya Yoshioka

Employing a monaural speech separation (SS) model as a front-end for automatic speech recognition (ASR) involves balancing two kinds of trade-offs.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Quantized Training of Gradient Boosting Decision Trees

2 code implementations20 Jul 2022 Yu Shi, Guolin Ke, Zhuoming Chen, Shuxin Zheng, Tie-Yan Liu

Recent years have witnessed significant success in Gradient Boosting Decision Trees (GBDT) for a wide range of machine learning applications.


Dynamic Scene Deblurring Based on Continuous Cross-Layer Attention Transmission

no code implementations23 Jun 2022 Xia Hua, Mingxin Li, Junxiong Fei, Yu Shi, Jianguo Liu, Hanyu Hong

In most of these networks, only the features refined by the attention maps can be passed to the next layer and the attention maps of different layers are separated from each other, which does not make full use of the attention information from different layers in the CNN.


Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering

1 code implementation9 May 2022 Min Peng, Chongyang Wang, Yuan Gao, Yu Shi, Xiang-Dong Zhou

With a multiscale sampling, RMI iterates the interaction of appearance-motion information at each scale and the question embeddings to build the multilevel question-guided visual representations.

Question Answering Video Question Answering +1

Benchmarking Graphormer on Large-Scale Molecular Modeling Datasets

3 code implementations9 Mar 2022 Yu Shi, Shuxin Zheng, Guolin Ke, Yifei Shen, Jiacheng You, Jiyan He, Shengjie Luo, Chang Liu, Di He, Tie-Yan Liu

This technical note describes the recent updates of Graphormer, including architecture design modifications, and the adaption to 3D molecular dynamics simulation.

Benchmarking Graph Regression +1

An Empirical Study of Graphormer on Large-Scale Molecular Modeling Datasets

no code implementations28 Feb 2022 Yu Shi, Shuxin Zheng, Guolin Ke, Yifei Shen, Jiacheng You, Jiyan He, Shengjie Luo, Chang Liu, Di He, Tie-Yan Liu

This technical note describes the recent updates of Graphormer, including architecture design modifications, and the adaption to 3D molecular dynamics simulation.

Florence: A New Foundation Model for Computer Vision

1 code implementation22 Nov 2021 Lu Yuan, Dongdong Chen, Yi-Ling Chen, Noel Codella, Xiyang Dai, Jianfeng Gao, Houdong Hu, Xuedong Huang, Boxin Li, Chunyuan Li, Ce Liu, Mengchen Liu, Zicheng Liu, Yumao Lu, Yu Shi, Lijuan Wang, JianFeng Wang, Bin Xiao, Zhen Xiao, Jianwei Yang, Michael Zeng, Luowei Zhou, Pengchuan Zhang

Computer vision foundation models, which are trained on diverse, large-scale dataset and can be adapted to a wide range of downstream tasks, are critical for this mission to solve real-world computer vision applications.

Action Classification Action Recognition In Videos +12

Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering

1 code implementation10 Sep 2021 Min Peng, Chongyang Wang, Yuan Gao, Yu Shi, Xiang-Dong Zhou

Targeting these issues, this paper proposes a novel Temporal Pyramid Transformer (TPT) model with multimodal interaction for VideoQA.

Natural Language Understanding Question Answering +1

A Joint and Domain-Adaptive Approach to Spoken Language Understanding

no code implementations25 Jul 2021 Linhao Zhang, Yu Shi, Linjun Shou, Ming Gong, Houfeng Wang, Michael Zeng

In this paper, we attempt to bridge these two lines of research and propose a joint and domain adaptive approach to SLU.

Domain Adaptation Intent Detection +3

Research on Portfolio Liquidation Strategy under Discrete Times

no code implementations29 Mar 2021 Qixuan Luo, Yu Shi, Handong Li

The permanent impact generated by an asset in the portfolio during the liquidation will affect all assets, and the temporary impact generated by one asset will only affect itself.

Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model

no code implementations22 Feb 2021 Junwei Liao, Yu Shi, Ming Gong, Linjun Shou, Sefik Eskimez, Liyang Lu, Hong Qu, Michael Zeng

Many downstream tasks and human readers rely on the output of the ASR system; therefore, errors introduced by the speaker and ASR system alike will be propagated to the next task in the pipeline.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Improving Zero-shot Neural Machine Translation on Language-specific Encoders-Decoders

no code implementations12 Feb 2021 Junwei Liao, Yu Shi, Ming Gong, Linjun Shou, Hong Qu, Michael Zeng

However, the performance of using multiple encoders and decoders on zero-shot translation still lags behind universal NMT.

Denoising Machine Translation +2

Speech-language Pre-training for End-to-end Spoken Language Understanding

no code implementations11 Feb 2021 Yao Qian, Ximo Bian, Yu Shi, Naoyuki Kanda, Leo Shen, Zhen Xiao, Michael Zeng

End-to-end (E2E) spoken language understanding (SLU) can infer semantics directly from speech signal without cascading an automatic speech recognizer (ASR) with a natural language understanding (NLU) module.

Ranked #3 on Spoken Language Understanding on Fluent Speech Commands (using extra training data)

Language Modelling Natural Language Understanding +1

Deterministic generation of multidimensional photonic cluster states using time-delay feedback

no code implementations19 Jan 2021 Yu Shi, Edo Waks

Cluster states are useful in many quantum information processing applications.

Quantum Physics

A Novel Method for Inference of Acyclic Chemical Compounds with Bounded Branch-height Based on Artificial Neural Networks and Integer Programming

1 code implementation21 Sep 2020 Naveed Ahmed Azam, Jianshen Zhu, Yanming Sun, Yu Shi, Aleksandar Shurbevski, Liang Zhao, Hiroshi Nagamochi, Tatsuya Akutsu

In the second phase, given a target value $y^*$ of property $\pi$, a feature vector $x^*$ is inferred by solving an MILP formulated from the trained ANN so that $\psi(x^*)$ is close to $y^*$ and then a set of chemical structures $G^*$ such that $f(G^*)= x^*$ is enumerated by a graph search algorithm.

Data Structures and Algorithms Computational Engineering, Finance, and Science 05C92, 92E10, 05C30, 68T07, 90C11, 92-04

Recognizing Micro-Expression in Video Clip with Adaptive Key-Frame Mining

1 code implementation19 Sep 2020 Min Peng, Chongyang Wang, Yuan Gao, Tao Bi, Tong Chen, Yu Shi, Xiang-Dong Zhou

As a spontaneous expression of emotion on face, micro-expression reveals the underlying emotion that cannot be controlled by human.

DeepPrognosis: Preoperative Prediction of Pancreatic Cancer Survival and Surgical Margin via Contrast-Enhanced CT Imaging

no code implementations26 Aug 2020 Jiawen Yao, Yu Shi, Le Lu, Jing Xiao, Ling Zhang

We present a multi-task CNN to accomplish both tasks of outcome and margin prediction where the network benefits from learning the tumor resection margin related features to improve survival prediction.

Survival Analysis Survival Prediction

Deep learning to estimate the physical proportion of infected region of lung for COVID-19 pneumonia with CT image set

no code implementations9 Jun 2020 Wei Wu, Yu Shi, Xukun Li, Yukun Zhou, Peng Du, Shuangzhi Lv, Tingbo Liang, Jifang Sheng

For the segmented masks of intact lung and infected regions, the best method could achieve 0. 972 and 0. 757 measure in mean Dice similarity coefficient on our test benchmark.

Computed Tomography (CT)

Improving Readability for Automatic Speech Recognition Transcription

no code implementations9 Apr 2020 Junwei Liao, Sefik Emre Eskimez, Liyang Lu, Yu Shi, Ming Gong, Linjun Shou, Hong Qu, Michael Zeng

In this work, we propose a novel NLP task called ASR post-processing for readability (APR) that aims to transform the noisy ASR output into a readable text for humans and downstream tasks while maintaining the semantic meaning of the speaker.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

BERT-AL: BERT for Arbitrarily Long Document Understanding

no code implementations ICLR 2020 Ruixuan Zhang, Zhuoyu Wei, Yu Shi, Yining Chen

When we apply BERT to long text tasks, e. g., document-level text summarization: 1) Truncating inputs by the maximum sequence length will decrease performance, since the model cannot capture long dependency and global information ranging the whole document.

document understanding Text Summarization

Meta-Graph Based HIN Spectral Embedding: Methods, Analyses, and Insights

no code implementations29 Sep 2019 Carl Yang, Yichen Feng, Pan Li, Yu Shi, Jiawei Han

In this work, we propose to study the utility of different meta-graphs, as well as how to simultaneously leverage multiple meta-graphs for HIN embedding in an unsupervised manner.

Discovering Hypernymy in Text-Rich Heterogeneous Information Network by Exploiting Context Granularity

1 code implementation4 Sep 2019 Yu Shi, Jiaming Shen, Yuchen Li, Naijing Zhang, Xinwei He, Zhengzhi Lou, Qi Zhu, Matthew Walker, Myunghwan Kim, Jiawei Han

Extensive experiments on two large real-world datasets demonstrate the effectiveness of HyperMine and the utility of modeling context granularity.

Knowledge Graphs

A Novel Apex-Time Network for Cross-Dataset Micro-Expression Recognition

1 code implementation7 Apr 2019 Min Peng, Chongyang Wang, Tao Bi, Tong Chen, Xiangdong Zhou, Yu Shi

As researchers working on such topics are moving to learn from the nature of micro-expression, the practice of using deep learning techniques has evolved from processing the entire video clip of micro-expression to the recognition on apex frame.

Micro Expression Recognition Micro-Expression Recognition

Easing Embedding Learning by Comprehensive Transcription of Heterogeneous Information Networks

1 code implementation10 Jul 2018 Yu Shi, Qi Zhu, Fang Guo, Chao Zhang, Jiawei Han

To cope with the challenges in the comprehensive transcription of HINs, we propose the HEER algorithm, which embeds HINs via edge representations that are further coupled with properly-learned heterogeneous metrics.

Feature Engineering Network Embedding

Training of photonic neural networks through in situ backpropagation

no code implementations25 May 2018 Tyler W. Hughes, Momchil Minkov, Yu Shi, Shanhui Fan

Recently, integrated optics has gained interest as a hardware platform for implementing machine learning algorithms.

BIG-bench Machine Learning

AspEm: Embedding Learning by Aspects in Heterogeneous Information Networks

no code implementations5 Mar 2018 Yu Shi, Huan Gui, Qi Zhu, Lance Kaplan, Jiawei Han

Therefore, we are motivated to propose a novel embedding learning framework---AspEm---to preserve the semantic information in HINs based on multiple aspects.

Link Prediction Network Embedding

Gradient Boosting With Piece-Wise Linear Regression Trees

1 code implementation15 Feb 2018 Yu Shi, Jian Li, Zhize Li

We show that PL Trees can accelerate convergence of GBDT and improve the accuracy.

Ensemble Learning regression

mvn2vec: Preservation and Collaboration in Multi-View Network Embedding

1 code implementation19 Jan 2018 Yu Shi, Fangqiu Han, Xinwei He, Xinran He, Carl Yang, Jie Luo, Jiawei Han

With experiments on a series of synthetic datasets, a large-scale internal Snapchat dataset, and two public datasets, we confirm the validity and importance of preservation and collaboration as two objectives for multi-view network embedding.

Network Embedding

PReP: Path-Based Relevance from a Probabilistic Perspective in Heterogeneous Information Networks

no code implementations5 Jun 2017 Yu Shi, Po-Wei Chan, Honglei Zhuang, Huan Gui, Jiawei Han

We also identify, from real-world data, and propose to model cross-meta-path synergy, which is a characteristic important for defining path-based HIN relevance and has not been modeled by existing methods.

Cannot find the paper you are looking for? You can Submit a new open access paper.