Search Results for author: Bo Zheng

Found 158 papers, 53 papers with code

GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning

no code implementations17 Apr 2025 Liangyu Xu, Yingxiu Zhao, Jingyun Wang, Yingyao Wang, Bu Pi, Chen Wang, Mingliang Zhang, Jihao Gu, Xiang Li, Xiaoyong Zhu, Jun Song, Bo Zheng

Geometry problem-solving (GPS), a challenging task requiring both visual comprehension and symbolic reasoning, effectively measures the reasoning capabilities of multimodal large language models (MLLMs).

DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging

no code implementations16 Apr 2025 Tianhui Song, Weixin Feng, Shuai Wang, Xubin Li, Tiezheng Ge, Bo Zheng, LiMin Wang

The success of text-to-image (T2I) generation models has spurred a proliferation of numerous model checkpoints fine-tuned from the same base model on various specialized datasets.

ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph

no code implementations20 Mar 2025 Langming Liu, Haibin Chen, Yuhao Wang, Yujin Yuan, Shilei Liu, Wenbo Su, Xiangyu Zhao, Bo Zheng

To bridge the evaluation gap, we propose ECKGBench, a dataset specifically designed to evaluate the capacities of LLMs in e-commerce knowledge.

Benchmarking Hallucination +1

Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation

1 code implementation20 Mar 2025 Yijia Luo, Yulin Song, Xingyao Zhang, Jiaheng Liu, Weixun Wang, Gengru Chen, Wenbo Su, Bo Zheng

Recent advancements in large language models (LLMs) have demonstrated remarkable reasoning capabilities through long chain-of-thought (CoT) reasoning.

KDSelector: A Knowledge-Enhanced and Data-Efficient Model Selector Learning Framework for Time Series Anomaly Detection

1 code implementation16 Mar 2025 Zhiyu Liang, Dongrui Cai, Chenyuan Zhang, Zheng Liang, Chen Liang, Bo Zheng, Shi Qiu, Jin Wang, Hongzhi Wang

Model selection has been raised as an essential problem in the area of time series anomaly detection (TSAD), because there is no single best TSAD model for the highly heterogeneous time series in real-world applications.

Anomaly Detection Model Selection +2

D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning

no code implementations14 Mar 2025 Jia Zhang, Chen-Xi Zhang, Yao Liu, Yi-Xuan Jin, Xiao-Wen Yang, Bo Zheng, Yi Liu, Lan-Zhe Guo

In this paper, we first establish data selection criteria based on three distinct aspects of data value: diversity, difficulty, and dependability, and then propose the D3 method comprising two key steps of scoring and selection.

Diversity Instruction Following

Nash Equilibrium Constrained Auto-bidding With Bi-level Reinforcement Learning

no code implementations13 Mar 2025 Zhiyu Mou, Miao Xu, Rongquan Bai, Zhuoran Yang, Chuan Yu, Jian Xu, Bo Zheng

However, the NCB problem presents significant challenges due to its constrained bi-level structure and the typically large number of advertisers involved.

Gradient Deconfliction via Orthogonal Projections onto Subspaces For Multi-task Learning

no code implementations5 Mar 2025 Shijie Zhu, Hui Zhao, Tianshu Wu, Pengjie Wang, Hongbo Deng, Jian Xu, Bo Zheng

Although multi-task learning (MTL) has been a preferred approach and successfully applied in many real-world scenarios, MTL models are not guaranteed to outperform single-task models on all tasks mainly due to the negative effects of conflicting gradients among the tasks.

Multi-Task Learning

ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models

1 code implementation27 Feb 2025 Haibin Chen, Kangtao Lv, Chengwei Hu, Yanshi Li, Yujin Yuan, Yancheng He, Xingyao Zhang, Langming Liu, Shilei Liu, Wenbo Su, Bo Zheng

To address these problems, we propose \textbf{ChineseEcomQA}, a scalable question-answering benchmark focused on fundamental e-commerce concepts.

Question Answering RAG +1

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

1 code implementation26 Feb 2025 Yancheng He, Shilong Li, Jiaheng Liu, Weixun Wang, Xingyuan Bu, Ge Zhang, Zhongyuan Peng, Zhaoxiang Zhang, Zhicheng Zheng, Wenbo Su, Bo Zheng

In this paper, to understand the qualities of these long CoTs and measure the critique abilities of existing LLMs on these long CoTs, we introduce the DeltaBench, including the generated long CoTs from different o1-like models (e. g., QwQ, DeepSeek-R1) for different reasoning tasks (e. g., Math, Code, General Reasoning), to measure the ability to detect errors in long CoT reasoning.

Math

UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering

1 code implementation26 Feb 2025 Langming Liu, Shilei Liu, Yujin Yuan, Yizhen Zhang, Bencheng Yan, Zhiyuan Zeng, ZiHao Wang, Jiaqi Liu, Di Wang, Wenbo Su, Pengjie Wang, Jian Xu, Bo Zheng

To address this concern, we propose \name, a benchmark designed to evaluate the effectiveness of user embeddings in prompting LLMs for personalization.

Question Answering

AIR: Complex Instruction Generation via Automatic Iterative Refinement

1 code implementation25 Feb 2025 Wei Liu, Yancheng He, Hui Huang, Chengwei Hu, Jiaheng Liu, Shilong Li, Wenbo Su, Bo Zheng

With the development of large language models, their ability to follow simple instructions has significantly improved.

VALUE: Value-Aware Large Language Model for Query Rewriting via Weighted Trie in Sponsored Search

no code implementations25 Feb 2025 Boyang Zuo, Xiao Zhang, Feng Li, Pengjie Wang, Jian Xu, Bo Zheng

In the realm of sponsored search advertising, matching advertisements with the search intent of a user's query is crucial.

Attribute Language Modeling +2

HiddenDetect: Detecting Jailbreak Attacks against Large Vision-Language Models via Monitoring Hidden States

1 code implementation20 Feb 2025 Yilei Jiang, Xinyan Gao, Tianshuo Peng, Yingshui Tan, Xiaoyong Zhu, Bo Zheng, Xiangyu Yue

The integration of additional modalities increases the susceptibility of large vision-language models (LVLMs) to safety risks, such as jailbreak attacks, compared to their language-only counterparts.

"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models

no code implementations17 Feb 2025 Jihao Gu, Yingyao Wang, Pi Bu, Chen Wang, ZiMing Wang, Tengtao Song, Donglai Wei, Jiale Yuan, Yingxiu Zhao, Yancheng He, Shilong Li, Jiaheng Liu, Meng Cao, Jun Song, Yingshui Tan, Xiang Li, Wenbo Su, Zhicheng Zheng, Xiaoyong Zhu, Bo Zheng

The evaluation of factual accuracy in large vision language models (LVLMs) has lagged behind their rapid development, making it challenging to fully reflect these models' knowledge capacity and reliability.

Object Recognition Question Answering +1

Equilibrate RLHF: Towards Balancing Helpfulness-Safety Trade-off in Large Language Models

no code implementations17 Feb 2025 Yingshui Tan, Yilei Jiang, Yanshi Li, Jiaheng Liu, Xingyuan Bu, Wenbo Su, Xiangyu Yue, Xiaoyong Zhu, Bo Zheng

Fine-tuning large language models (LLMs) based on human preferences, commonly achieved through reinforcement learning from human feedback (RLHF), has been effective in improving their performance.

Safety Alignment

Large Language Models Are Universal Recommendation Learners

no code implementations5 Feb 2025 Junguang Jiang, Yanwen Huang, Bin Liu, Xiaoyu Kong, Ziru Xu, Han Zhu, Jian Xu, Bo Zheng

In real-world recommender systems, different tasks are typically addressed using supervised learning on task-specific datasets with carefully designed model architectures.

Prompt Engineering Recommendation Systems

MIM: Multi-modal Content Interest Modeling Paradigm for User Behavior Modeling

no code implementations1 Feb 2025 Bencheng Yan, Si Chen, Shichang Jia, Jianyu Liu, Yueran Liu, Chenghan Fu, Wanxian Guan, Hui Zhao, Xiang Zhang, Kai Zhang, Wenbo Su, Pengjie Wang, Jian Xu, Bo Zheng, Baolin Liu

Click-Through Rate (CTR) prediction is a crucial task in recommendation systems, online searches, and advertising platforms, where accurately capturing users' real interests in content is essential for performance.

Click-Through Rate Prediction Collaborative Filtering +1

An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online Advertising

1 code implementation26 Jan 2025 Zhijian Duan, Yusen Huo, Tianyu Wang, Zhilin Zhang, Yeshu Li, Chuan Yu, Jian Xu, Bo Zheng, Xiaotie Deng

Extensive simulation experiments and real-world A/B testing validate the effectiveness of ABPlanner, demonstrating its capability to enhance the cumulative value achieved by auto-bidders.

Sequential Decision Making

DAGPrompT: Pushing the Limits of Graph Prompting with a Distribution-aware Graph Prompt Tuning Approach

1 code implementation25 Jan 2025 Qin Chen, Liang Wang, Bo Zheng, Guojie Song

This paper identifies two key challenges in adapting graph prompting methods for complex graphs: (1) adapting the model to new distributions in downstream tasks to mitigate pre-training and fine-tuning discrepancies from heterophily and (2) customizing prompts for hop-specific node requirements.

General Knowledge Graph Classification

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

no code implementations21 Jan 2025 Zihan Qiu, Zeyu Huang, Bo Zheng, Kaiyue Wen, Zekun Wang, Rui Men, Ivan Titov, Dayiheng Liu, Jingren Zhou, Junyang Lin

Existing MoE training frameworks usually employ the parallel training strategy so that $f_i$ and the LBL are calculated within a $\textbf{micro-batch}$ and then averaged across parallel groups.

Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration

1 code implementation9 Jan 2025 Xuyang Liu, ZiMing Wang, Yuhang Han, Yingyao Wang, Jiale Yuan, Jun Song, Bo Zheng, Linfeng Zhang, Siteng Huang, Honggang Chen

Multimodal large language models (MLLMs) have attracted considerable attention due to their exceptional performance in visual content understanding and reasoning.

ProgCo: Program Helps Self-Correction of Large Language Models

no code implementations2 Jan 2025 Xiaoshuai Song, Yanan Wu, Weixun Wang, Jiaheng Liu, Wenbo Su, Bo Zheng

Then, program-driven refinement (ProgRe) receives feedback from ProgVe, conducts dual reflection and refinement on both responses and verification programs to mitigate misleading of incorrect feedback in complex reasoning tasks.

Instruction Following

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

1 code implementation24 Dec 2024 Chao Deng, Jiale Yuan, Pi Bu, Peijie Wang, Zhong-Zhi Li, Jian Xu, Xiao-Hui Li, Yuan Gao, Jun Song, Bo Zheng, Cheng-Lin Liu

Large vision language models (LVLMs) have improved the document understanding capabilities remarkably, enabling the handling of complex document elements, longer contexts, and a wider range of tasks.

document understanding Question Answering

Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback

1 code implementation20 Dec 2024 Jiaming Ji, Jiayi Zhou, Hantao Lou, Boyuan Chen, Donghai Hong, Xuyao Wang, Wenqi Chen, Kaile Wang, Rui Pan, Jiahao Li, Mohan Wang, Josef Dai, Tianyi Qiu, Hua Xu, Dong Li, WeiPeng Chen, Jun Song, Bo Zheng, Yaodong Yang

In this work, we make the first attempt to fine-tune all-modality models (i. e. input and output with any modality, also named any-to-any models) using human preference data across all modalities (including text, image, audio, and video), ensuring its behavior aligns with human intentions.

All Instruction Following

Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation

no code implementations19 Dec 2024 Jihao Gu, Yingyao Wang, Meng Cao, Pi Bu, Jun Song, Yancheng He, Shilong Li, Bo Zheng

Direct Preference Optimization (DPO) has been demonstrated to be highly effective in mitigating hallucinations in Large Vision Language Models (LVLMs) by aligning their outputs more closely with human preferences.

Hallucination

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

1 code implementation18 Dec 2024 YiPeng Zhang, Yifan Liu, Zonghao Guo, Yidan Zhang, Xuesong Yang, Chi Chen, Jun Song, Bo Zheng, Yuan YAO, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun

To address this issue, we present LLaVA-UHD v2, an advanced MLLM centered around a Hierarchical window transformer that enables capturing diverse visual granularity by constructing and integrating a high-resolution feature pyramid.

Attribute Text Generation

AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games

1 code implementation14 Dec 2024 Kefan Su, Yusen Huo, Zhilin Zhang, Shuai Dou, Chuan Yu, Jian Xu, Zongqing Lu, Bo Zheng

We believe that AuctionNet is applicable not only to research on bid decision-making in ad auctions but also to the general area of decision-making in large-scale games.

Decision Making

HyViLM: Enhancing Fine-Grained Recognition with a Hybrid Encoder for Vision-Language Models

no code implementations11 Dec 2024 Shiding Zhu, Wenhui Dong, Jun Song, Yingbo Wang, Yanan Guo, Bo Zheng

A common approach currently involves dynamically cropping the original high-resolution image into smaller sub-images, which are then fed into a vision encoder that was pre-trained on lower-resolution images.

TextVQA

WiS Platform: Enhancing Evaluation of LLM-Based Multi-Agent Systems Through Game-Based Analysis

no code implementations4 Dec 2024 Chengwei Hu, Jianhui Zheng, Yancheng He, Hangyu Guo, Junguang Jiang, Han Zhu, Kai Sun, Yuning Jiang, Wenbo Su, Bo Zheng

In this paper, to facilitate the research on LLM-based MAS, we introduce an open, scalable, and real-time updated platform for accessing and analyzing the LLM-based MAS based on the games Who is Spy?"

FocusLLaVA: A Coarse-to-Fine Approach for Efficient and Effective Visual Token Compression

no code implementations21 Nov 2024 Yuke Zhu, Chi Xie, Shuang Liang, Bo Zheng, Sheng Guo

Recent advances on Multi-modal Large Language Models have demonstrated that high-resolution image input is crucial for model capabilities, especially for fine-grained tasks.

Visual Question Answering

Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning

no code implementations20 Nov 2024 Gang Zhao, XiMing Zhang, Chenji Lu, Hui Zhao, Tianshu Wu, Pengjie Wang, Jian Xu, Bo Zheng

Effective query-item relevance modeling is pivotal for enhancing user experience and safeguarding user satisfaction in e-commerce search systems.

Knowledge Distillation Large Language Model

PSA-VLM: Enhancing Vision-Language Model Safety through Progressive Concept-Bottleneck-Driven Alignment

no code implementations18 Nov 2024 Zhendong Liu, Yuanbi Nie, Yingshui Tan, Jiaheng Liu, Xiangyu Yue, Qiushi Cui, Chongjun Wang, Xiaoyong Zhu, Bo Zheng

However, recent research shows that the visual modality in VLMs is highly vulnerable, allowing attackers to bypass safety alignment in LLMs through visually transmitted content, launching harmful attacks.

Language Modeling Language Modelling +1

FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution

no code implementations30 Oct 2024 Shuai Wang, Zexian Li, Tianhui Song, Xubin Li, Tiezheng Ge, Bo Zheng, LiMin Wang

Arbitrary-resolution image generation still remains a challenging task in AIGC, as it requires handling varying resolutions and aspect ratios while maintaining high visual quality.

Image Generation

M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation

no code implementations28 Oct 2024 Jiaheng Liu, Ken Deng, Congnan Liu, Jian Yang, Shukai Liu, He Zhu, Peng Zhao, Linzheng Chai, Yanan Wu, Ke Jin, Ge Zhang, Zekun Wang, Guoan Zhang, Bangyu Xiang, Wenbo Su, Bo Zheng

Besides, the existing benchmarks usually report overall average scores of different languages, where the fine-grained abilities in different completion scenarios are ignored.

Code Completion

2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision

no code implementations25 Oct 2024 Shilong Li, Yancheng He, Hui Huang, Xingyuan Bu, Jiaheng Liu, Hangyu Guo, Weixun Wang, Jihao Gu, Wenbo Su, Bo Zheng

Recent advancements in Direct Preference Optimization (DPO) have significantly enhanced the alignment of Large Language Models (LLMs) with human preferences, owing to its simplicity and effectiveness.

Adaptive Dense Reward: Understanding the Gap Between Action and Reward Space in Alignment

no code implementations23 Oct 2024 Yanshi Li, Shaopan Xiong, Gengru Chen, Xiaoyang Li, Yijia Luo, Xingyao Zhang, Yanhui Huang, Xingyuan Bu, Yingshui Tan, Chun Yuan, Jiamang Wang, Wenbo Su, Bo Zheng

Our method improves the success rate on adversarial samples by 10\% compared to the sample-wise approach, and achieves a 1. 3\% improvement on evaluation benchmarks such as MMLU, GSM8K, HumanEval, etc.

GSM8K HumanEval +1

MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models

1 code implementation15 Oct 2024 Pei Wang, Yanan Wu, Zekun Wang, Jiaheng Liu, Xiaoshuai Song, Zhongyuan Peng, Ken Deng, Chenchen Zhang, Jiakai Wang, Junran Peng, Ge Zhang, Hangyu Guo, Zhaoxiang Zhang, Wenbo Su, Bo Zheng

Besides, all evaluation metrics of our MTU-Bench are based on the prediction results and the ground truth without using any GPT or human evaluation metrics.

Can VLMs Play Action Role-Playing Games? Take Black Myth Wukong as a Study Case

no code implementations19 Sep 2024 Peng Chen, Pi Bu, Jun Song, Yuan Gao, Bo Zheng

We define 12 tasks within the game, with 75% focusing on combat, and incorporate several state-of-the-art VLMs into this benchmark.

Large Language Model

Enhancing Taobao Display Advertising with Multimodal Representations: Challenges, Approaches and Insights

no code implementations28 Jul 2024 Xiang-Rong Sheng, Feifan Yang, Litong Gong, Biao Wang, Zhangming Chan, Yujing Zhang, Yueyao Cheng, Yong-Nan Zhu, Tiezheng Ge, Han Zhu, Yuning Jiang, Jian Xu, Bo Zheng

Despite the recognized potential of multimodal data to improve model accuracy, many large-scale industrial recommendation systems, including Taobao display advertising system, predominantly depend on sparse ID features in their models.

Recommendation Systems Semantic Similarity +1

DDK: Distilling Domain Knowledge for Efficient Large Language Models

no code implementations23 Jul 2024 Jiaheng Liu, Chenchen Zhang, Jinyang Guo, Yuanxing Zhang, Haoran Que, Ken Deng, Zhiqi Bai, Jie Liu, Ge Zhang, Jiakai Wang, Yanan Wu, Congnan Liu, Wenbo Su, Jiamang Wang, Lin Qu, Bo Zheng

Despite the advanced intelligence abilities of large language models (LLMs) in various applications, they still face significant computational and storage demands.

Knowledge Distillation

Deep Bag-of-Words Model: An Efficient and Interpretable Relevance Architecture for Chinese E-Commerce

no code implementations12 Jul 2024 Zhe Lin, Jiwei Tan, Dan Ou, Xi Chen, Shaowei Yao, Bo Zheng

Text relevance or text matching of query and product is an essential technique for the e-commerce search system to ensure that the displayed products can match the intent of the query.

Computational Efficiency Language Modelling +1

GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation

1 code implementation17 Jun 2024 Shihao Cai, Keqin Bao, Hangyu Guo, Jizhi Zhang, Jun Song, Bo Zheng

To overcome this issue, we introduce a novel pipeline that leverages GPT-4 and GPT-4V to generate relatively basic geometry problems with aligned text and images, facilitating model learning.

Image Generation Math

D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models

no code implementations3 Jun 2024 Haoran Que, Jiaheng Liu, Ge Zhang, Chenchen Zhang, Xingwei Qu, Yinghao Ma, Feiyu Duan, Zhiqi Bai, Jiakai Wang, Yuanxing Zhang, Xu Tan, Jie Fu, Wenbo Su, Jiamang Wang, Lin Qu, Bo Zheng

To address the limitations of existing methods, inspired by the Scaling Law for performance prediction, we propose to investigate the Scaling Law of the Domain-specific Continual Pre-Training (D-CPT Law) to decide the optimal mixture ratio with acceptable training costs for LLMs of different sizes.

Math

Demystify Mamba in Vision: A Linear Attention Perspective

1 code implementation26 May 2024 Dongchen Han, Ziyi Wang, Zhuofan Xia, Yizeng Han, Yifan Pu, Chunjiang Ge, Jun Song, Shiji Song, Bo Zheng, Gao Huang

By exploring the similarities and disparities between the effective Mamba and subpar linear attention Transformer, we provide comprehensive analyses to demystify the key factors behind Mamba's success.

Image Classification Mamba

AIGB: Generative Auto-bidding via Conditional Diffusion Modeling

no code implementations25 May 2024 Jiayan Guo, Yusen Huo, Zhilin Zhang, Tianyu Wang, Chuan Yu, Jian Xu, Yan Zhang, Bo Zheng

Auto-bidding plays a crucial role in facilitating online advertising by automatically providing bids for advertisers.

Reinforcement Learning (RL)

Safety Alignment for Vision Language Models

no code implementations22 May 2024 Zhendong Liu, Yuanbi Nie, Yingshui Tan, Xiangyu Yue, Qiushi Cui, Chongjun Wang, Xiaoyong Zhu, Bo Zheng

To address this issue, we enhance the existing VLMs' visual modality safety alignment by adding safety modules, including a safety projector, safety tokens, and a safety head, through a two-stage training process, effectively improving the model's defense against risky images.

Red Teaming Safety Alignment

Enhancing Prompt Following with Visual Control Through Training-Free Mask-Guided Diffusion

no code implementations23 Apr 2024 Hongyu Chen, Yiqi Gao, Min Zhou, Peng Wang, Xubin Li, Tiezheng Ge, Bo Zheng

Meanwhile, a network, dubbed as Masked ControlNet, is designed to utilize these object masks for object generation in the misaligned visual control region.

Attribute Object

Accelerating Image Generation with Sub-path Linear Approximation Model

no code implementations22 Apr 2024 Chen Xu, Tianhui Song, Weixin Feng, Xubin Li, Tiezheng Ge, Bo Zheng, LiMin Wang

Diffusion models have significantly advanced the state of the art in image, audio, and video generation tasks.

Denoising Image Generation +1

RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance

no code implementations22 Apr 2024 Chengrui Wang, PengFei Liu, Min Zhou, Ming Zeng, Xubin Li, Tiezheng Ge, Bo Zheng

The style guidance is a hand image, e. g., the malformed hand itself, and is employed to furnish the style reference for hand refining.

MetaSplit: Meta-Split Network for Limited-Stock Product Recommendation

no code implementations11 Mar 2024 Wenhao Wu, Jialiang Zhou, Ailong He, Shuguang Han, Jufeng Chen, Bo Zheng

Due to limited user interactions for each product (i. e. item), the corresponding item embedding in the CTR model may not easily converge.

Click-Through Rate Prediction Meta-Learning +1

Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation

no code implementations5 Mar 2024 Weijie Li, Litong Gong, Yiran Zhu, Fanda Fan, Biao Wang, Tiezheng Ge, Bo Zheng

The experimental results demonstrate the effectiveness of our approach in improving the fidelity of generated videos.

Denoising Image Animation +1

MEBS: Multi-task End-to-end Bid Shading for Multi-slot Display Advertising

no code implementations5 Mar 2024 Zhen Gong, Lvyin Niu, Yang Zhao, Miao Xu, Zhenzhe Zheng, Haoqi Zhang, Zhilin Zhang, Fan Wu, Rongquan Bai, Chuan Yu, Jian Xu, Bo Zheng

Through extensive offline and online experiments, we demonstrate the effectiveness and efficiency of our method, and we obtain a 7. 01% lift in Gross Merchandise Volume, a 7. 42% lift in Return on Investment, and a 3. 26% lift in ad buy count.

AtomoVideo: High Fidelity Image-to-Video Generation

no code implementations4 Mar 2024 Litong Gong, Yiran Zhu, Weijie Li, Xiaoyang Kang, Biao Wang, Tiezheng Ge, Bo Zheng

Recently, video generation has achieved significant rapid development based on superior text-to-image generation techniques.

Image to Video Generation Text-to-Image Generation

Making Pre-trained Language Models Great on Tabular Prediction

1 code implementation4 Mar 2024 Jiahuan Yan, Bo Zheng, Hongxia Xu, Yiheng Zhu, Danny Z. Chen, Jimeng Sun, Jian Wu, Jintai Chen

Condensing knowledge from diverse domains, language models (LMs) possess the capability to comprehend feature names from various tables, potentially serving as versatile learners in transferring knowledge across distinct tables and diverse prediction tasks, but their discrete text representation space is inherently incompatible with numerical feature values in tables.

Prediction

SERVAL: Synergy Learning between Vertical Models and LLMs towards Oracle-Level Zero-shot Medical Prediction

no code implementations3 Mar 2024 Jiahuan Yan, Jintai Chen, Chaowen Hu, Bo Zheng, Yaojun Hu, Jimeng Sun, Jian Wu

Recent development of large language models (LLMs) has exhibited impressive zero-shot proficiency on generic and common sense questions.

Common Sense Reasoning Diagnostic

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues

1 code implementation22 Feb 2024 Ge Bai, Jie Liu, Xingyuan Bu, Yancheng He, Jiaheng Liu, Zhanhui Zhou, Zhuoran Lin, Wenbo Su, Tiezheng Ge, Bo Zheng, Wanli Ouyang

By conducting a detailed analysis of real multi-turn dialogue data, we construct a three-tier hierarchical ability taxonomy comprising 4208 turns across 1388 multi-turn dialogues in 13 distinct tasks.

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models

1 code implementation22 Feb 2024 Yanan Wu, Jie Liu, Xingyuan Bu, Jiaheng Liu, Zhanhui Zhou, Yuanxing Zhang, Chenchen Zhang, Zhiqi Bai, Haibin Chen, Tiezheng Ge, Wanli Ouyang, Wenbo Su, Bo Zheng

This paper introduces ConceptMath, a bilingual (English and Chinese), fine-grained benchmark that evaluates concept-wise mathematical reasoning of Large Language Models (LLMs).

Math Mathematical Reasoning

Automated Deterministic Auction Design with Objective Decomposition

no code implementations19 Feb 2024 Zhijian Duan, Haoran Sun, Yichong Xia, Siqiang Wang, Zhilin Zhang, Chuan Yu, Jian Xu, Bo Zheng, Xiaotie Deng

Identifying high-revenue mechanisms that are both dominant strategy incentive compatible (DSIC) and individually rational (IR) is a fundamental challenge in auction design.

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

no code implementations13 Jan 2024 Jiaheng Liu, Zhiqi Bai, Yuanxing Zhang, Chenchen Zhang, Yu Zhang, Ge Zhang, Jiakai Wang, Haoran Que, Yukang Chen, Wenbo Su, Tiezheng Ge, Jie Fu, Wenhu Chen, Bo Zheng

Typically, training LLMs with long context sizes is computationally expensive, requiring extensive training hours and GPU resources.

4k Position

Calibration-compatible Listwise Distillation of Privileged Features for CTR Prediction

no code implementations14 Dec 2023 Xiaoqiang Gui, Yueyao Cheng, Xiang-Rong Sheng, Yunfeng Zhao, Guoxian Yu, Shuguang Han, Yuning Jiang, Jian Xu, Bo Zheng

A typical practice is privileged features distillation (PFD): train a teacher model using all features (including privileged ones) and then distill the knowledge from the teacher model using a student model (excluding the privileged features), which is then employed for online serving.

Click-Through Rate Prediction

Combating Bilateral Edge Noise for Robust Link Prediction

1 code implementation NeurIPS 2023 Zhanke Zhou, Jiangchao Yao, Jiaxu Liu, Xiawei Guo, Quanming Yao, Li He, Liang Wang, Bo Zheng, Bo Han

To address this dilemma, we propose an information-theory-guided principle, Robust Graph Information Bottleneck (RGIB), to extract reliable supervision signals and avoid representation collapse.

Denoising Link Prediction +2

Robust Representation Learning for Unified Online Top-K Recommendation

no code implementations24 Oct 2023 Minfang Lu, Yuchen Jiang, Huihui Dong, Qi Li, Ziru Xu, Yuanlin Liu, Lixia Wu, Haoyuan Hu, Han Zhu, Yuning Jiang, Jian Xu, Bo Zheng

The robust representation learning employs domain adversarial learning and multi-view wasserstein distribution learning to learn robust representations.

Fairness Representation Learning

OneSeg: Self-learning and One-shot Learning based Single-slice Annotation for 3D Medical Image Segmentation

no code implementations24 Sep 2023 Yixuan Wu, Bo Zheng, Jintai Chen, Danny Z. Chen, Jian Wu

As deep learning methods continue to improve medical image segmentation performance, data annotation is still a big bottleneck due to the labor-intensive and time-consuming burden on medical experts, especially for 3D images.

Image Segmentation Medical Image Segmentation +5

Neuro-Symbolic Recommendation Model based on Logic Query

no code implementations14 Sep 2023 Maonian Wu, Bang Chen, Shaojun Zhu, Bo Zheng, Wei Peng, Mingyi Zhang

A recommendation system assists users in finding items that are relevant to them.

Entire Space Cascade Delayed Feedback Modeling for Effective Conversion Rate Prediction

no code implementations9 Aug 2023 Yunfeng Zhao, Xu Yan, Xiaoqiang Gui, Shuguang Han, Xiang-Rong Sheng, Guoxian Yu, Jufeng Chen, Zhao Xu, Bo Zheng

Furthermore, there is delayed feedback in both conversion and refund events and they are sequentially dependent, named cascade delayed feedback (CDF), which significantly harms data freshness for model training.

Prediction Recommendation Systems +1

Neural-Symbolic Recommendation with Graph-Enhanced Information

1 code implementation11 Jul 2023 Bang Chen, Wei Peng, Maonian Wu, Bo Zheng, Shaojun Zhu

Some researchers use user behavior for logic reasoning to achieve recommendation prediction from the perspective of cognitive reasoning, but this kind of reasoning is a local one and ignores implicit information on a global scale.

Recommendation Systems

Multi-Scenario Ranking with Adaptive Feature Learning

no code implementations29 Jun 2023 Yu Tian, Bofang Li, Si Chen, Xubin Li, Hongbo Deng, Jian Xu, Bo Zheng, Qian Wang, Chenliang Li

Recently, Multi-Scenario Learning (MSL) is widely used in recommendation and retrieval systems in the industry because it facilitates transfer learning from different scenarios, mitigating data sparsity and reducing maintenance cost.

Retrieval Transfer Learning

Adversarial Constrained Bidding via Minimax Regret Optimization with Causality-Aware Reinforcement Learning

no code implementations12 Jun 2023 Haozhe Wang, Chao Du, Panyan Fang, Li He, Liang Wang, Bo Zheng

In this regard, we explore the problem of constrained bidding in adversarial bidding environments, which assumes no knowledge about the adversarial factors.

Meta-Learning reinforcement-learning

COPR: Consistency-Oriented Pre-Ranking for Online Advertising

no code implementations6 Jun 2023 Zhishan Zhao, Jingyue Gao, Yu Zhang, Shuguang Han, Siyuan Lou, Xiang-Rong Sheng, Zhe Wang, Han Zhu, Yuning Jiang, Jian Xu, Bo Zheng

In this architecture, the pre-ranking model is expected to be a lightweight approximation of the ranking model, which handles more candidates with strict latency requirements.

On Structural Expressive Power of Graph Transformers

no code implementations23 May 2023 Wenhao Zhu, Tianyu Wen, Guojie Song, Liang Wang, Bo Zheng

Graph Transformer has recently received wide attention in the research community with its outstanding performance, yet its structural expressive power has not been well analyzed.

Graph Neural Network

FedAds: A Benchmark for Privacy-Preserving CVR Estimation with Vertical Federated Learning

no code implementations15 May 2023 Penghui Wei, Hongjian Dou, Shaoguo Liu, Rongjun Tang, Li Liu, Liang Wang, Bo Zheng

We introduce FedAds, the first benchmark for CVR estimation with vFL, to facilitate standardized and systematical evaluations for vFL algorithms.

Privacy Preserving Vertical Federated Learning

Hybrid Contrastive Constraints for Multi-Scenario Ad Ranking

no code implementations6 Feb 2023 Shanlei Mu, Penghui Wei, Wayne Xin Zhao, Shaoguo Liu, Liang Wang, Bo Zheng

In this paper, we propose a Hybrid Contrastive Constrained approach (HC^2) for multi-scenario ad ranking.

Contrastive Learning

RLTP: Reinforcement Learning to Pace for Delayed Impression Modeling in Preloaded Ads

no code implementations6 Feb 2023 Penghui Wei, Yongqiang Chen, Shaoguo Liu, Liang Wang, Bo Zheng

In a whole delivery period, advertisers usually desire a certain impression count for the ads, and they also expect that the delivery performance is as good as possible (e. g., obtaining high click-through rate).

reinforcement-learning Reinforcement Learning (RL)

End-to-End Modeling Hierarchical Time Series Using Autoregressive Transformer and Conditional Normalizing Flow based Reconciliation

1 code implementation28 Dec 2022 Shiyu Wang, Fan Zhou, Yinbo Sun, Lintao Ma, James Zhang, Yangfei Zheng, Bo Zheng, Lei Lei, Yun Hu

Multivariate time series forecasting with hierarchical structure is pervasive in real-world applications, demanding not only predicting each level of the hierarchy, but also reconciling all forecasts to ensure coherency, i. e., the forecasts should satisfy the hierarchical aggregation constraints.

Multivariate Time Series Forecasting Time Series

Correlative Preference Transfer with Hierarchical Hypergraph Network for Multi-Domain Recommendation

no code implementations21 Nov 2022 Zixuan Xu, Penghui Wei, Shaoguo Liu, Weimin Zhang, Liang Wang, Bo Zheng

Conventional graph neural network based methods usually deal with each domain separately, or train a shared model to serve all domains.

Graph Neural Network Marketing

Sustainable Online Reinforcement Learning for Auto-bidding

1 code implementation13 Oct 2022 Zhiyu Mou, Yusen Huo, Rongquan Bai, Mingzhou Xie, Chuan Yu, Jian Xu, Bo Zheng

Due to safety concerns, it was believed that the RL training process can only be carried out in an offline virtual advertising system (VAS) that is built based on the historical data generated in the RAS.

Q-Learning reinforcement-learning +2

Towards Understanding the Overfitting Phenomenon of Deep Click-Through Rate Prediction Models

1 code implementation4 Sep 2022 Zhao-Yu Zhang, Xiang-Rong Sheng, Yujing Zhang, Biye Jiang, Shuguang Han, Hongbo Deng, Bo Zheng

However, far less attention has been paid to the overfitting problem of models in recommendation systems, which, on the contrary, is recognized as a critical issue for deep neural networks.

Click-Through Rate Prediction Recommendation Systems

Modeling Adaptive Fine-grained Task Relatedness for Joint CTR-CVR Estimation

no code implementations29 Aug 2022 Zihan Lin, Xuanhua Yang, Xiaoyu Peng, Wayne Xin Zhao, Shaoguo Liu, Liang Wang, Bo Zheng

For this purpose, we build a relatedness prediction network, so that it can predict the contrast strength for inter-task representations of an instance.

Contrastive Learning Multi-Task Learning +2

KEEP: An Industrial Pre-Training Framework for Online Recommendation via Knowledge Extraction and Plugging

no code implementations22 Aug 2022 Yujing Zhang, Zhangming Chan, Shuhao Xu, Weijie Bian, Shuguang Han, Hongbo Deng, Bo Zheng

To alleviate this issue, we propose to extract knowledge from the \textit{super-domain} that contains web-scale and long-time impression data, and further assist the online recommendation task (downstream task).

Recommendation Systems

Joint Optimization of Ranking and Calibration with Contextualized Hybrid Model

1 code implementation12 Aug 2022 Xiang-Rong Sheng, Jingyue Gao, Yueyao Cheng, Siran Yang, Shuguang Han, Hongbo Deng, Yuning Jiang, Jian Xu, Bo Zheng

It can be attributed to the calibration ability of the pointwise loss since the prediction can be viewed as the click probability.

Click-Through Rate Prediction

Adaptive Domain Interest Network for Multi-domain Recommendation

no code implementations20 Jun 2022 Yuchen Jiang, Qi Li, Han Zhu, Jinbei Yu, Jin Li, Ziru Xu, Huihui Dong, Bo Zheng

Industrial recommender systems usually hold data from multiple business scenarios and are expected to provide recommendation services for these scenarios simultaneously.

Domain Adaptation Recommendation Systems +1

Learning Disentangled Representations for Counterfactual Regression via Mutual Information Minimization

no code implementations2 Jun 2022 Mingyuan Cheng, Xinru Liao, Quan Liu, Bin Ma, Jian Xu, Bo Zheng

Learning individual-level treatment effect is a fundamental problem in causal inference and has received increasing attention in many areas, especially in the user growth area which concerns many internet companies.

Causal Inference counterfactual +3

Hierarchically Constrained Adaptive Ad Exposure in Feeds

no code implementations31 May 2022 Dagui Chen, Qi Yan, Chunjie Chen, Zhenzhe Zheng, Yangsu Liu, Zhenjia Ma, Chuan Yu, Jian Xu, Bo Zheng

To this end, adaptive ad exposure has become an appealing strategy to boost the overall performance of the feed.

Computational Efficiency

Towards Personalized Bundle Creative Generation with Contrastive Non-Autoregressive Decoding

no code implementations30 May 2022 Penghui Wei, Shaoguo Liu, Xuanhua Yang, Liang Wang, Bo Zheng

Current bundle generation studies focus on generating a combination of items to improve user experience.

GBA: A Tuning-free Approach to Switch between Synchronous and Asynchronous Training for Recommendation Model

no code implementations23 May 2022 Wenbo Su, Yuanxing Zhang, Yufeng Cai, Kaixu Ren, Pengjie Wang, Huimin Yi, Yue Song, Jing Chen, Hongbo Deng, Jian Xu, Lin Qu, Bo Zheng

High-concurrency asynchronous training upon parameter server (PS) architecture and high-performance synchronous training upon all-reduce (AR) architecture are the most commonly deployed distributed training modes for recommendation models.

Recommendation Systems

CREATER: CTR-driven Advertising Text Generation with Controlled Pre-Training and Contrastive Fine-Tuning

no code implementations NAACL (ACL) 2022 Penghui Wei, Xuanhua Yang, Shaoguo Liu, Liang Wang, Bo Zheng

This paper focuses on automatically generating the text of an ad, and the goal is that the generated text can capture user interest for achieving higher click-through rate (CTR).

Contrastive Learning Text Generation

StableMoE: Stable Routing Strategy for Mixture of Experts

1 code implementation ACL 2022 Damai Dai, Li Dong, Shuming Ma, Bo Zheng, Zhifang Sui, Baobao Chang, Furu Wei

We point out that existing learning-to-route MoE methods suffer from the routing fluctuation issue, i. e., the target expert of the same input may change along with training, but only one expert will be activated for the input during inference.

Language Modeling Language Modelling +1

PICASSO: Unleashing the Potential of GPU-centric Training for Wide-and-deep Recommender Systems

1 code implementation11 Apr 2022 Yuanxing Zhang, Langshi Chen, Siran Yang, Man Yuan, Huimin Yi, Jie Zhang, Jiamang Wang, Jianbo Dong, Yunlong Xu, Yue Song, Yong Li, Di Zhang, Wei Lin, Lin Qu, Bo Zheng

However, we observe that GPU devices in training recommender systems are underutilized, and they cannot attain an expected throughput improvement as what it has achieved in CV and NLP areas.

Marketing Recommendation Systems

APG: Adaptive Parameter Generation Network for Click-Through Rate Prediction

1 code implementation30 Mar 2022 Bencheng Yan, Pengjie Wang, Kai Zhang, Feng Li, Hongbo Deng, Jian Xu, Bo Zheng

In many web applications, deep learning-based CTR prediction models (deep CTR models for short) are widely adopted.

Click-Through Rate Prediction

AMCAD: Adaptive Mixed-Curvature Representation based Advertisement Retrieval System

no code implementations28 Mar 2022 Zhirong Xu, Shiyang Wen, Junshan Wang, Guojun Liu, Liang Wang, Zhi Yang, Lei Ding, Yan Zhang, Di Zhang, Jian Xu, Bo Zheng

Moreover, to deploy AMCAD in Taobao, one of the largest ecommerce platforms with hundreds of million users, we design an efficient two-layer online retrieval framework for the task of graph based advertisement retrieval.

Graph Embedding Information Retrieval +1

BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis

1 code implementation10 Mar 2022 Haiyang Liu, Zihao Zhu, Naoya Iwamoto, Yichen Peng, Zhengqing Li, You Zhou, Elif Bozkurt, Bo Zheng

Achieving realistic, vivid, and human-like synthesized conversational gestures conditioned on multi-modal data is still an unsolved problem due to the lack of available datasets, models and standard evaluation metrics.

Gesture Generation Gesture Recognition

Asymptotically Unbiased Estimation for Delayed Feedback Modeling via Label Correction

1 code implementation14 Feb 2022 Yu Chen, Jiaqi Jin, Hui Zhao, Pengjie Wang, Guojun Liu, Jian Xu, Bo Zheng

Moreover, to estimate CVR upon the freshly observed but biased distribution with fake negatives, the importance sampling is widely used to reduce the distribution bias.

MBCT: Tree-Based Feature-Aware Binning for Individual Uncertainty Calibration

1 code implementation9 Feb 2022 Siguang Huang, Yunli Wang, Lili Mou, Huayue Zhang, Han Zhu, Chuan Yu, Bo Zheng

In previous work, researchers have developed several calibration methods to post-process the outputs of a predictor to obtain calibrated values, such as binning and scaling methods.

Medical Diagnosis

UKD: Debiasing Conversion Rate Estimation via Uncertainty-regularized Knowledge Distillation

no code implementations20 Jan 2022 Zixuan Xu, Penghui Wei, Weimin Zhang, Shaoguo Liu, Liang Wang, Bo Zheng

Then a student model is trained on both clicked and unclicked ads with knowledge distillation, performing uncertainty modeling to alleviate the inherent noise in pseudo-labels.

Knowledge Distillation Selection bias

Adversarial Gradient Driven Exploration for Deep Click-Through Rate Prediction

no code implementations21 Dec 2021 Kailun Wu, Zhangming Chan, Weijie Bian, Lejian Ren, Shiming Xiang, Shuguang Han, Hongbo Deng, Bo Zheng

We further show that such a process is equivalent to adding an adversarial perturbation to the model input, and thereby name our proposed approach as an the Adversarial Gradient Driven Exploration (AGE).

Click-Through Rate Prediction Recommendation Systems

Characterizing the adversarial vulnerability of speech self-supervised learning

no code implementations8 Nov 2021 Haibin Wu, Bo Zheng, Xu Li, Xixin Wu, Hung-Yi Lee, Helen Meng

As the paradigm of the self-supervised learning upstream model followed by downstream tasks arouses more attention in the speech community, characterizing the adversarial robustness of such paradigm is of high priority.

Adversarial Robustness Benchmarking +3

Binary Code based Hash Embedding for Web-scale Applications

no code implementations24 Aug 2021 Bencheng Yan, Pengjie Wang, Jinquan Liu, Wei Lin, Kuang-Chih Lee, Jian Xu, Bo Zheng

In these applications, embedding learning of categorical features is crucial to the success of deep learning models.

Recommendation Systems

Real-world Video Deblurring: A Benchmark Dataset and An Efficient Recurrent Neural Network

1 code implementation ECCV 2020 Zhihang Zhong, Ye Gao, Yinqiang Zheng, Bo Zheng, Imari Sato

Real-world video deblurring in real time still remains a challenging task due to the complexity of spatially and temporally varying blur itself and the requirement of low computational cost.

Deblurring Image Deblurring +1

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

no code implementations8 Jun 2021 Ziyu Guan, Hongchang Wu, Qingyu Cao, Hao liu, Wei Zhao, Sheng Li, Cai Xu, Guang Qiu, Jian Xu, Bo Zheng

Although a few studies use multi-agent reinforcement learning to set up a cooperative game, they still suffer the following drawbacks: (1) They fail to avoid collusion solutions where all the advertisers involved in an auction collude to bid an extremely low price on purpose.

Model Optimization Multi-agent Reinforcement Learning

4DComplete: Non-Rigid Motion Estimation Beyond the Observable Surface

1 code implementation ICCV 2021 Yang Li, Hikari Takehara, Takafumi Taketomi, Bo Zheng, Matthias Nießner

Tracking non-rigidly deforming scenes using range sensors has numerous applications including computer vision, AR/VR, and robotics.

Motion Estimation

Tracking Air Pollution in China: Near Real-Time PM2.5 Retrievals from Multiple Data Sources

no code implementations11 Mar 2021 Guannan Geng, Qingyang Xiao, Shigan Liu, Xiaodong Liu, Jing Cheng, Yixuan Zheng, Dan Tong, Bo Zheng, Yiran Peng, Xiaomeng Huang, Kebin He, Qiang Zhang

Accordingly, a full-coverage high-resolution air pollutant dataset with timely updates and historical long-term records is essential to support both research and environmental management.

Management

Global Daily CO$_2$ emissions for the year 2020

no code implementations3 Mar 2021 Zhu Liu, Zhu Deng, Philippe Ciais, Jianguang Tan, Biqing Zhu, Steven J. Davis, Robbie Andrew, Olivier Boucher, Simon Ben Arous, Pep Canadel, Xinyu Dou, Pierre Friedlingstein, Pierre Gentine, Rui Guo, Chaopeng Hong, Robert B. Jackson, Daniel M. Kammen, Piyu Ke, Corinne Le Quere, Crippa Monica, Greet Janssens-Maenhout, Glen Peters, Katsumasa Tanaka, Yilong Wang, Bo Zheng, Haiwang Zhong, Taochun Sun, Hans Joachim Schellnhuber

That even substantial world-wide lockdowns of activity led to a one-time decline in global CO$_2$ emissions of only 5. 4% in one year highlights the significant challenges for climate change mitigation that we face in the post-COVID era.

Atmospheric and Oceanic Physics General Economics Economics

Document Modeling with Graph Attention Networks for Multi-grained Machine Reading Comprehension

1 code implementation ACL 2020 Bo Zheng, Haoyang Wen, Yaobo Liang, Nan Duan, Wanxiang Che, Daxin Jiang, Ming Zhou, Ting Liu

Natural Questions is a new challenging machine reading comprehension benchmark with two-grained answers, which are a long answer (typically a paragraph) and a short answer (one or more entities inside the long answer).

Graph Attention Machine Reading Comprehension +1

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

no code implementations7 Mar 2020 Xiang Li, Chao Wang, Jiwei Tan, Xiaoyi Zeng, Dan Ou, Bo Zheng

Finally, we achieve the multimodal item representations by combining both modality-specific and modality-invariant representations.

Click-Through Rate Prediction Prediction +1

RPM-Oriented Query Rewriting Framework for E-commerce Keyword-Based Sponsored Search

no code implementations28 Oct 2019 Xiuying Chen, Daorui Xiao, Shen Gao, Guojun Liu, Wei. Lin, Bo Zheng, Dongyan Zhao, Rui Yan

Sponsored search optimizes revenue and relevance, which is estimated by Revenue Per Mille (RPM).

A bi-diffusion based layer-wise sampling method for deep learning in large graphs

no code implementations25 Sep 2019 Yu He, Shiyang Wen, Wenjin Wu, Yan Zhang, Siran Yang, Yuan Wei, Di Zhang, Guojie Song, Wei Lin, Liang Wang, Bo Zheng

The Graph Convolutional Network (GCN) and its variants are powerful models for graph representation learning and have recently achieved great success on many graph-based applications.

Graph Representation Learning

A Minimax Game for Instance based Selective Transfer Learning

no code implementations1 Jul 2019 Bo wang, Minghui Qiu, Xisen Wang, Yaliang Li, Yu Gong, Xiaoyi Zeng, Jung Huang, Bo Zheng, Deng Cai, Jingren Zhou

To the best of our knowledge, this is the first to build a minimax game based model for selective transfer learning.

Text Retrieval Transfer Learning

An AMR Aligner Tuned by Transition-based Parser

1 code implementation EMNLP 2018 Yijia Liu, Wanxiang Che, Bo Zheng, Bing Qin, Ting Liu

In this paper, we propose a new rich resource enhanced AMR aligner which produces multiple alignments and a new transition system for AMR parsing along with its oracle parser.

AMR Parsing POS +1

The HIT-SCIR System for End-to-End Parsing of Universal Dependencies

no code implementations CONLL 2017 Wanxiang Che, Jiang Guo, Yuxuan Wang, Bo Zheng, Huaipeng Zhao, Yang Liu, Dechuan Teng, Ting Liu

Our system includes three pipelined components: \textit{tokenization}, \textit{Part-of-Speech} (POS) \textit{tagging} and \textit{dependency parsing}.

Dependency Parsing Information Retrieval +4

Robust 3D Features for Matching between Distorted Range Scans Captured by Moving Systems

no code implementations CVPR 2014 Xiangqi Huang, Bo Zheng, Takeshi Masuda, Katsushi Ikeuchi

Our feature description is designed as two steps: 1) we normalize the detected local regions to canonical shapes for robust matching; 2) we encode each key point with multiple vectors at different Morse function values.

3D Reconstruction

Cannot find the paper you are looking for? You can Submit a new open access paper.