Search Results for author: Jiarui Lu

Found 20 papers, 11 papers with code

High Dimensional M-Estimation with Missing Outcomes: A Semi-Parametric Framework

no code implementations26 Nov 2019 Abhishek Chakrabortty, Jiarui Lu, T. Tony Cai, Hongzhe Li

Under mild tail assumptions and arbitrarily chosen (working) models for the propensity score (PS) and the outcome regression (OR) estimators, satisfying only some high-level conditions, we establish finite sample performance bounds for the DDR estimator showing its (optimal) $L_2$ error rate to be $\sqrt{s (\log d)/ n}$ when both models are correct, and its consistency and DR properties when only one of them is correct.

Causal Inference regression +1

CREAD: Combined Resolution of Ellipses and Anaphora in Dialogues

1 code implementation NAACL 2021 Bo-Hsiang Tseng, Shruti Bhargava, Jiarui Lu, Joel Ruben Antony Moniz, Dhivya Piraviperumal, Lin Li, Hong Yu

In this work, we propose a novel joint learning framework of modeling coreference resolution and query rewriting for complex, multi-turn dialogue understanding.

coreference-resolution Dialogue Understanding

TorchDrug: A Powerful and Flexible Machine Learning Platform for Drug Discovery

1 code implementation16 Feb 2022 Zhaocheng Zhu, Chence Shi, Zuobai Zhang, Shengchao Liu, Minghao Xu, Xinyu Yuan, Yangtian Zhang, Junkun Chen, Huiyu Cai, Jiarui Lu, Chang Ma, Runcheng Liu, Louis-Pascal Xhonneux, Meng Qu, Jian Tang

However, lacking domain knowledge (e. g., which tasks to work on), standard benchmarks and data preprocessing pipelines are the main obstacles for machine learning researchers to work in this domain.

BIG-bench Machine Learning Drug Discovery +2

PEER: A Comprehensive and Multi-Task Benchmark for Protein Sequence Understanding

1 code implementation5 Jun 2022 Minghao Xu, Zuobai Zhang, Jiarui Lu, Zhaocheng Zhu, Yangtian Zhang, Chang Ma, Runcheng Liu, Jian Tang

However, there is a lack of a standard benchmark to evaluate the performance of different methods, which hinders the progress of deep learning in this field.

Feature Engineering Multi-Task Learning +2

Multi-modal Molecule Structure-text Model for Text-based Retrieval and Editing

1 code implementation21 Dec 2022 Shengchao Liu, Weili Nie, Chengpeng Wang, Jiarui Lu, Zhuoran Qiao, Ling Liu, Jian Tang, Chaowei Xiao, Anima Anandkumar

Here we present a multi-modal molecule structure-text model, MoleculeSTM, by jointly learning molecules' chemical structures and textual descriptions via a contrastive learning strategy.

Contrastive Learning Drug Discovery +2

Str2Str: A Score-based Framework for Zero-shot Protein Conformation Sampling

1 code implementation5 Jun 2023 Jiarui Lu, Bozitao Zhong, Zuobai Zhang, Jian Tang

The dynamic nature of proteins is crucial for determining their biological functions and properties, for which Monte Carlo (MC) and molecular dynamics (MD) simulations stand as predominant tools to study such phenomena.

Benchmarking Denoising +1

Probing the Multi-turn Planning Capabilities of LLMs via 20 Question Games

1 code implementation2 Oct 2023 Yizhe Zhang, Jiarui Lu, Navdeep Jaitly

In this paper, we offer a surrogate problem which assesses an LLMs's capability to deduce an entity unknown to itself, but revealed to a judge, by asking the judge a series of queries.

STEER: Semantic Turn Extension-Expansion Recognition for Voice Assistants

no code implementations25 Oct 2023 Leon Liyang Zhang, Jiarui Lu, Joel Ruben Antony Moniz, Aditya Kulkarni, Dhivya Piraviperumal, Tien Dung Tran, Nicholas Tzou, Hong Yu

In the context of a voice assistant system, steering refers to the phenomenon in which a user issues a follow-up command attempting to direct or clarify a previous turn.

Sentence

Can Large Language Models Understand Context?

no code implementations1 Feb 2024 YIlun Zhu, Joel Ruben Antony Moniz, Shruti Bhargava, Jiarui Lu, Dhivya Piraviperumal, Site Li, Yuan Zhang, Hong Yu, Bo-Hsiang Tseng

Understanding context is key to understanding human language, an ability which Large Language Models (LLMs) have been increasingly seen to demonstrate to an impressive extent.

In-Context Learning Quantization

Structure-Informed Protein Language Model

1 code implementation7 Feb 2024 Zuobai Zhang, Jiarui Lu, Vijil Chenthamarakshan, Aurélie Lozano, Payel Das, Jian Tang

To address this issue, we introduce the integration of remote homology detection to distill structural information into protein language models without requiring explicit protein structures as input.

Protein Function Prediction Protein Language Model

Fusing Neural and Physical: Augment Protein Conformation Sampling with Tractable Simulations

no code implementations16 Feb 2024 Jiarui Lu, Zuobai Zhang, Bozitao Zhong, Chence Shi, Jian Tang

The protein dynamics are common and important for their biological functions and properties, the study of which usually involves time-consuming molecular dynamics (MD) simulations in silico.

Physical Simulations

Cannot find the paper you are looking for? You can Submit a new open access paper.