Search Results for author: Ning Zhang

Found 128 papers, 29 papers with code

Rethinking the Defocus Blur Detection Problem and A Real-Time Deep DBD Model

no code implementations ECCV 2020 Ning Zhang, Junchi Yan

In this work, we propose novel perspectives on the DBD problem and design convenient approach to build a real-time cost-effective DBD model.

Data Augmentation Defocus Blur Detection

Accelerating Multimodal Large Language Models by Searching Optimal Vision Token Reduction

no code implementations30 Nov 2024 Shiyu Zhao, Zhenting Wang, Felix Juefei-Xu, Xide Xia, Miao Liu, Xiaofang Wang, Mingfu Liang, Ning Zhang, Dimitris N. Metaxas, Licheng Yu

For Scenario II, based on the reduction strategy from G-Search, we design a parametric sigmoid function (P-Sigmoid) to guide the reduction at each layer of the MLLM, whose parameters are optimized by Bayesian Optimization.

Sequential LLM Framework for Fashion Recommendation

no code implementations15 Oct 2024 Han Liu, Xianfeng Tang, Tianlang Chen, Jiapeng Liu, Indu Indu, Henry Peng Zou, Peng Dai, Roberto Fernandez Galan, Michael D Porter, Dongmei Jia, Ning Zhang, Lian Xiong

The fashion industry is one of the leading domains in the global e-commerce sector, prompting major online retailers to employ recommendation systems for product suggestions and customer convenience.

Language Modelling Large Language Model +3

Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach

no code implementations8 Oct 2024 Sha Guo, Zhuo Chen, Yang Zhao, Ning Zhang, Xiaotong Li, Lingyu Duan

Extensive experiments demonstrate the effectiveness of the proposed framework in both image reconstruction and downstream machine vision tasks such as object detection, segmentation, and facial landmark detection, achieving superior perceptual quality compared to state-of-the-art methods.

Data Compression Facial Landmark Detection +5

ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue

no code implementations26 Sep 2024 Zhangpu Li, Changhong Zou, Suxue Ma, Zhicheng Yang, Chen Du, YouBao Tang, Zhenjie Cao, Ning Zhang, Jui-Hsin Lai, Ruei-Sung Lin, Yuan Ni, Xingzhi Sun, Jing Xiao, Jieke Hou, Kai Zhang, Mei Han

In our online medical consultation scenario, a doctor responds to the texts and images provided by a patient in multiple rounds to diagnose her/his health condition, forming a multi-turn multimodal medical dialogue format.

Medical Visual Question Answering Question Answering +2

Model-in-the-Loop (MILO): Accelerating Multimodal AI Data Annotation with LLMs

no code implementations16 Sep 2024 Yifan Wang, David Stevens, Pranay Shah, WenWen Jiang, Miao Liu, Xu Chen, Robert Kuo, Na Li, Boying Gong, Daniel Lee, Jiabo Hu, Ning Zhang, Bob Kamma

The growing demand for AI training data has transformed data annotation into a global industry, but traditional approaches relying on human annotators are often time-consuming, labor-intensive, and prone to inconsistent quality.

MGSA: Multi-Granularity Graph Structure Attention for Knowledge Graph-to-Text Generation

no code implementations16 Sep 2024 Shanshan Wang, Chun Zhang, Ning Zhang

The Knowledge Graph-to-Text Generation task aims to convert structured knowledge graphs into coherent and human-readable natural language text.

KG-to-Text Generation Knowledge Graphs +1

SoK: Security and Privacy Risks of Medical AI

no code implementations11 Sep 2024 Yuanhaur Chang, Han Liu, Evin Jaff, Chenyang Lu, Ning Zhang

The integration of technology and healthcare has ushered in a new era where software systems, powered by artificial intelligence and machine learning, have become essential components of medical products and services.

Joint Model Assignment and Resource Allocation for Cost-Effective Mobile Generative Services

no code implementations9 Sep 2024 Shuangwei Gao, Peng Yang, Yuxin Kong, Feng Lyu, Ning Zhang

Artificial Intelligence Generated Content (AIGC) services can efficiently satisfy user-specified content creation demands, but the high computational requirements pose various challenges to supporting mobile users at scale.

Self-Supervised Multi-Scale Network for Blind Image Deblurring via Alternating Optimization

no code implementations2 Sep 2024 Lening Guo, Jing Yu, Ning Zhang, Chuangbai Xiao

In the image estimation step, we construct a multi-scale generator network with multiple inputs and multiple outputs to collaboratively estimate latent images at various scales, supervised by an image pyramid constructed from only the blurred image.

Blind Image Deblurring Image Deblurring

Data Exposure from LLM Apps: An In-depth Investigation of OpenAI's GPTs

no code implementations23 Aug 2024 Evin Jaff, Yuhao Wu, Ning Zhang, Umar Iqbal

Our measurements indicate that the disclosures for most of the collected data types are omitted in privacy policies, with only 5. 8% of Actions clearly disclosing their data collection practices.

Class-aware and Augmentation-free Contrastive Learning from Label Proportion

no code implementations13 Aug 2024 Jialiang Wang, Ning Zhang, Shimin Di, Ruidong Wang, Lei Chen

Learning from Label Proportion (LLP) is a weakly supervised learning scenario in which training data is organized into predefined bags of instances, disclosing only the class label proportions per bag.

Contrastive Learning Representation Learning +1

The Llama 3 Herd of Models

1 code implementation31 Jul 2024 Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang, Bobbie Chern, Charlotte Caucheteux, Chaya Nayak, Chloe Bi, Chris Marra, Chris McConnell, Christian Keller, Christophe Touret, Chunyang Wu, Corinne Wong, Cristian Canton Ferrer, Cyrus Nikolaidis, Damien Allonsius, Daniel Song, Danielle Pintz, Danny Livshits, Danny Wyatt, David Esiobu, Dhruv Choudhary, Dhruv Mahajan, Diego Garcia-Olano, Diego Perino, Dieuwke Hupkes, Egor Lakomkin, Ehab AlBadawy, Elina Lobanova, Emily Dinan, Eric Michael Smith, Filip Radenovic, Francisco Guzmán, Frank Zhang, Gabriel Synnaeve, Gabrielle Lee, Georgia Lewis Anderson, Govind Thattai, Graeme Nail, Gregoire Mialon, Guan Pang, Guillem Cucurell, Hailey Nguyen, Hannah Korevaar, Hu Xu, Hugo Touvron, Iliyan Zarov, Imanol Arrieta Ibarra, Isabel Kloumann, Ishan Misra, Ivan Evtimov, Jack Zhang, Jade Copet, Jaewon Lee, Jan Geffert, Jana Vranes, Jason Park, Jay Mahadeokar, Jeet Shah, Jelmer Van der Linde, Jennifer Billock, Jenny Hong, Jenya Lee, Jeremy Fu, Jianfeng Chi, Jianyu Huang, Jiawen Liu, Jie Wang, Jiecao Yu, Joanna Bitton, Joe Spisak, Jongsoo Park, Joseph Rocca, Joshua Johnstun, Joshua Saxe, Junteng Jia, Kalyan Vasuden Alwala, Karthik Prasad, Kartikeya Upasani, Kate Plawiak, Ke Li, Kenneth Heafield, Kevin Stone, Khalid El-Arini, Krithika Iyer, Kshitiz Malik, Kuenley Chiu, Kunal Bhalla, Kushal Lakhotia, Lauren Rantala-Yeary, Laurens van der Maaten, Lawrence Chen, Liang Tan, Liz Jenkins, Louis Martin, Lovish Madaan, Lubo Malo, Lukas Blecher, Lukas Landzaat, Luke de Oliveira, Madeline Muzzi, Mahesh Pasupuleti, Mannat Singh, Manohar Paluri, Marcin Kardas, Maria Tsimpoukelli, Mathew Oldham, Mathieu Rita, Maya Pavlova, Melanie Kambadur, Mike Lewis, Min Si, Mitesh Kumar Singh, Mona Hassan, Naman Goyal, Narjes Torabi, Nikolay Bashlykov, Nikolay Bogoychev, Niladri Chatterji, Ning Zhang, Olivier Duchenne, Onur Çelebi, Patrick Alrassy, Pengchuan Zhang, Pengwei Li, Petar Vasic, Peter Weng, Prajjwal Bhargava, Pratik Dubal, Praveen Krishnan, Punit Singh Koura, Puxin Xu, Qing He, Qingxiao Dong, Ragavan Srinivasan, Raj Ganapathy, Ramon Calderer, Ricardo Silveira Cabral, Robert Stojnic, Roberta Raileanu, Rohan Maheswari, Rohit Girdhar, Rohit Patel, Romain Sauvestre, Ronnie Polidoro, Roshan Sumbaly, Ross Taylor, Ruan Silva, Rui Hou, Rui Wang, Saghar Hosseini, Sahana Chennabasappa, Sanjay Singh, Sean Bell, Seohyun Sonia Kim, Sergey Edunov, Shaoliang Nie, Sharan Narang, Sharath Raparthy, Sheng Shen, Shengye Wan, Shruti Bhosale, Shun Zhang, Simon Vandenhende, Soumya Batra, Spencer Whitman, Sten Sootla, Stephane Collot, Suchin Gururangan, Sydney Borodinsky, Tamar Herman, Tara Fowler, Tarek Sheasha, Thomas Georgiou, Thomas Scialom, Tobias Speckbacher, Todor Mihaylov, Tong Xiao, Ujjwal Karn, Vedanuj Goswami, Vibhor Gupta, Vignesh Ramanathan, Viktor Kerkez, Vincent Gonguet, Virginie Do, Vish Vogeti, Vítor Albiero, Vladan Petrovic, Weiwei Chu, Wenhan Xiong, Wenyin Fu, Whitney Meers, Xavier Martinet, Xiaodong Wang, Xiaofang Wang, Xiaoqing Ellen Tan, Xide Xia, Xinfeng Xie, Xuchao Jia, Xuewei Wang, Yaelle Goldschlag, Yashesh Gaur, Yasmine Babaei, Yi Wen, Yiwen Song, Yuchen Zhang, Yue Li, Yuning Mao, Zacharie Delpierre Coudert, Zheng Yan, Zhengxing Chen, Zoe Papakipos, Aaditya Singh, Aayushi Srivastava, Abha Jain, Adam Kelsey, Adam Shajnfeld, Adithya Gangidi, Adolfo Victoria, Ahuva Goldstand, Ajay Menon, Ajay Sharma, Alex Boesenberg, Alexei Baevski, Allie Feinstein, Amanda Kallet, Amit Sangani, Amos Teo, Anam Yunus, Andrei Lupu, Andres Alvarado, Andrew Caples, Andrew Gu, Andrew Ho, Andrew Poulton, Andrew Ryan, Ankit Ramchandani, Annie Dong, Annie Franco, Anuj Goyal, Aparajita Saraf, Arkabandhu Chowdhury, Ashley Gabriel, Ashwin Bharambe, Assaf Eisenman, Azadeh Yazdan, Beau James, Ben Maurer, Benjamin Leonhardi, Bernie Huang, Beth Loyd, Beto De Paola, Bhargavi Paranjape, Bing Liu, Bo Wu, Boyu Ni, Braden Hancock, Bram Wasti, Brandon Spence, Brani Stojkovic, Brian Gamido, Britt Montalvo, Carl Parker, Carly Burton, Catalina Mejia, Ce Liu, Changhan Wang, Changkyu Kim, Chao Zhou, Chester Hu, Ching-Hsiang Chu, Chris Cai, Chris Tindal, Christoph Feichtenhofer, Cynthia Gao, Damon Civin, Dana Beaty, Daniel Kreymer, Daniel Li, David Adkins, David Xu, Davide Testuggine, Delia David, Devi Parikh, Diana Liskovich, Didem Foss, Dingkang Wang, Duc Le, Dustin Holland, Edward Dowling, Eissa Jamil, Elaine Montgomery, Eleonora Presani, Emily Hahn, Emily Wood, Eric-Tuan Le, Erik Brinkman, Esteban Arcaute, Evan Dunbar, Evan Smothers, Fei Sun, Felix Kreuk, Feng Tian, Filippos Kokkinos, Firat Ozgenel, Francesco Caggioni, Frank Kanayet, Frank Seide, Gabriela Medina Florez, Gabriella Schwarz, Gada Badeer, Georgia Swee, Gil Halpern, Grant Herman, Grigory Sizov, Guangyi, Zhang, Guna Lakshminarayanan, Hakan Inan, Hamid Shojanazeri, Han Zou, Hannah Wang, Hanwen Zha, Haroun Habeeb, Harrison Rudolph, Helen Suk, Henry Aspegren, Hunter Goldman, Hongyuan Zhan, Ibrahim Damlaj, Igor Molybog, Igor Tufanov, Ilias Leontiadis, Irina-Elena Veliche, Itai Gat, Jake Weissman, James Geboski, James Kohli, Janice Lam, Japhet Asher, Jean-Baptiste Gaya, Jeff Marcus, Jeff Tang, Jennifer Chan, Jenny Zhen, Jeremy Reizenstein, Jeremy Teboul, Jessica Zhong, Jian Jin, Jingyi Yang, Joe Cummings, Jon Carvill, Jon Shepard, Jonathan McPhie, Jonathan Torres, Josh Ginsburg, Junjie Wang, Kai Wu, Kam Hou U, Karan Saxena, Kartikay Khandelwal, Katayoun Zand, Kathy Matosich, Kaushik Veeraraghavan, Kelly Michelena, Keqian Li, Kiran Jagadeesh, Kun Huang, Kunal Chawla, Kyle Huang, Lailin Chen, Lakshya Garg, Lavender A, Leandro Silva, Lee Bell, Lei Zhang, Liangpeng Guo, Licheng Yu, Liron Moshkovich, Luca Wehrstedt, Madian Khabsa, Manav Avalani, Manish Bhatt, Martynas Mankus, Matan Hasson, Matthew Lennie, Matthias Reso, Maxim Groshev, Maxim Naumov, Maya Lathi, Meghan Keneally, Miao Liu, Michael L. Seltzer, Michal Valko, Michelle Restrepo, Mihir Patel, Mik Vyatskov, Mikayel Samvelyan, Mike Clark, Mike Macey, Mike Wang, Miquel Jubert Hermoso, Mo Metanat, Mohammad Rastegari, Munish Bansal, Nandhini Santhanam, Natascha Parks, Natasha White, Navyata Bawa, Nayan Singhal, Nick Egebo, Nicolas Usunier, Nikhil Mehta, Nikolay Pavlovich Laptev, Ning Dong, Norman Cheng, Oleg Chernoguz, Olivia Hart, Omkar Salpekar, Ozlem Kalinli, Parkin Kent, Parth Parekh, Paul Saab, Pavan Balaji, Pedro Rittner, Philip Bontrager, Pierre Roux, Piotr Dollar, Polina Zvyagina, Prashant Ratanchandani, Pritish Yuvraj, Qian Liang, Rachad Alao, Rachel Rodriguez, Rafi Ayub, Raghotham Murthy, Raghu Nayani, Rahul Mitra, Rangaprabhu Parthasarathy, Raymond Li, Rebekkah Hogan, Robin Battey, Rocky Wang, Russ Howes, Ruty Rinott, Sachin Mehta, Sachin Siby, Sai Jayesh Bondu, Samyak Datta, Sara Chugh, Sara Hunt, Sargun Dhillon, Sasha Sidorov, Satadru Pan, Saurabh Mahajan, Saurabh Verma, Seiji Yamamoto, Sharadh Ramaswamy, Shaun Lindsay, Sheng Feng, Shenghao Lin, Shengxin Cindy Zha, Shishir Patil, Shiva Shankar, Shuqiang Zhang, Sinong Wang, Sneha Agarwal, Soji Sajuyigbe, Soumith Chintala, Stephanie Max, Stephen Chen, Steve Kehoe, Steve Satterfield, Sudarshan Govindaprasad, Sumit Gupta, Summer Deng, Sungmin Cho, Sunny Virk, Suraj Subramanian, Sy Choudhury, Sydney Goldman, Tal Remez, Tamar Glaser, Tamara Best, Thilo Koehler, Thomas Robinson, Tianhe Li, Tianjun Zhang, Tim Matthews, Timothy Chou, Tzook Shaked, Varun Vontimitta, Victoria Ajayi, Victoria Montanez, Vijai Mohan, Vinay Satish Kumar, Vishal Mangla, Vlad Ionescu, Vlad Poenaru, Vlad Tiberiu Mihailescu, Vladimir Ivanov, Wei Li, Wenchen Wang, WenWen Jiang, Wes Bouaziz, Will Constable, Xiaocheng Tang, Xiaojian Wu, Xiaolan Wang, Xilun Wu, Xinbo Gao, Yaniv Kleinman, Yanjun Chen, Ye Hu, Ye Jia, Ye Qi, Yenda Li, Yilin Zhang, Ying Zhang, Yossi Adi, Youngjin Nam, Yu, Wang, Yu Zhao, Yuchen Hao, Yundi Qian, Yunlu Li, Yuzi He, Zach Rait, Zachary DeVito, Zef Rosnbrick, Zhaoduo Wen, Zhenyu Yang, Zhiwei Zhao, Zhiyu Ma

This paper presents a new set of foundation models, called Llama 3.

Language Modelling Multi-task Language Understanding +2

EUDA: An Efficient Unsupervised Domain Adaptation via Self-Supervised Vision Transformer

1 code implementation31 Jul 2024 Ali Abedi, Q. M. Jonathan Wu, Ning Zhang, Farhad Pourpanah

Unsupervised domain adaptation (UDA) aims to mitigate the domain shift issue, where the distribution of training (source) data differs from that of testing (target) data.

Unsupervised Domain Adaptation

Enhancing Question Answering on Charts Through Effective Pre-training Tasks

no code implementations14 Jun 2024 Ashim Gupta, Vivek Gupta, Shuo Zhang, Yujie He, Ning Zhang, Shalin Shah

To address these issues, we propose three simple pre-training tasks that enforce the existing model in terms of both structural-visual knowledge, as well as its understanding of numerical questions.

document understanding Optical Character Recognition (OCR) +1

Data-driven Power Flow Linearization: Simulation

no code implementations10 Jun 2024 Mengshuo Jia, Gabriela Hug, Ning Zhang, Zhaojian Wang, Yi Wang, Chongqing Kang

Subsequently, this paper evaluates a total of 44 methods, containing over 30 existing DPFL approaches, some innovative DPFL techniques, and several classic physics-driven power flow linearization methods for benchmarking.

Benchmarking Computational Efficiency

Data-driven Power Flow Linearization: Theory

no code implementations10 Jun 2024 Mengshuo Jia, Gabriela Hug, Ning Zhang, Zhaojian Wang, Yi Wang, Chongqing Kang

Further, this tutorial implements extensive numerical comparisons of all existing DPFL methods (40 methods in total) and four classic physics-driven approaches, focusing on their generalizability, applicability, accuracy, and computational efficiency.

Computational Efficiency

Active ML for 6G: Towards Efficient Data Generation, Acquisition, and Annotation

1 code implementation5 Jun 2024 Omar Alhussein, Ning Zhang, Sami Muhaidat, Weihua Zhuang

While active learning research mainly focuses on data annotation, we call for a network-centric active learning framework that considers both annotation (i. e., what is the label) and data acquisition (i. e., which and how many samples to collect).

Active Learning Computational Efficiency

STHN: Deep Homography Estimation for UAV Thermal Geo-localization with Satellite Imagery

1 code implementation30 May 2024 Jiuhong Xiao, Ning Zhang, Daniel Tortei, Giuseppe Loianno

However, the efficiency and effectiveness of current TG approaches, are hindered by dense sampling on satellite maps and geometric noises in thermal query images.

Autonomous Navigation geo-localization +1

Maximum Likelihood Estimation on Stochastic Blockmodels for Directed Graph Clustering

1 code implementation28 Mar 2024 Mihai Cucuringu, Xiaowen Dong, Ning Zhang

This paper studies the directed graph clustering problem through the lens of statistics, where we formulate clustering as estimating underlying communities in the directed stochastic block model (DSBM).

Clustering Graph Clustering +1

Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models

no code implementations26 Mar 2024 Zhiyuan Yu, Xiaogeng Liu, Shunning Liang, Zach Cameron, Chaowei Xiao, Ning Zhang

Building on the insights from the user study, we also developed a system using AI as the assistant to automate the process of jailbreak prompt generation.

SecGPT: An Execution Isolation Architecture for LLM-Based Systems

1 code implementation8 Mar 2024 Yuhao Wu, Franziska Roesner, Tadayoshi Kohno, Ning Zhang, Umar Iqbal

These LLM apps leverage the de facto natural language-based automated execution paradigm of LLMs: that is, apps and their interactions are defined in natural language, provided access to user data, and allowed to freely interact with each other and the system.

Automatic and Universal Prompt Injection Attacks against Large Language Models

1 code implementation7 Mar 2024 Xiaogeng Liu, Zhiyuan Yu, Yizhe Zhang, Ning Zhang, Chaowei Xiao

Large Language Models (LLMs) excel in processing and generating human language, powered by their ability to interpret and follow instructions.

A New Era in LLM Security: Exploring Security Concerns in Real-World LLM-based Systems

no code implementations28 Feb 2024 Fangzhou Wu, Ning Zhang, Somesh Jha, Patrick McDaniel, Chaowei Xiao

Large Language Model (LLM) systems are inherently compositional, with individual LLM serving as the core foundation with additional layers of objects such as plugins, sandbox, and so on.

Language Modelling Large Language Model

Bidirectional Autoregressive Diffusion Model for Dance Generation

1 code implementation6 Feb 2024 Canyu Zhang, YouBao Tang, Ning Zhang, Ruei-Sung Lin, Mei Han, Jing Xiao, Song Wang

To make the generated dance motion smoother, a local information decoder is built for local motion enhancement.

Motion Generation

Preference Poisoning Attacks on Reward Model Learning

no code implementations2 Feb 2024 Junlin Wu, Jiongxiao Wang, Chaowei Xiao, Chenguang Wang, Ning Zhang, Yevgeniy Vorobeychik

In addition, we observe that the simpler and more scalable rank-by-distance approaches are often competitive with, and on occasion significantly outperform, gradient-based methods.

Recommendation Systems

Deep Video Inverse Tone Mapping Based on Temporal Clues

1 code implementation CVPR 2024 Yuyao Ye, Ning Zhang, Yang Zhao, Hongbin Cao, Ronggang Wang

Although many deep image ITM methods can generate impressive results the field of video ITM is still to be explored.

Tone Mapping

Adaptive Digital Twin for UAV-Assisted Integrated Sensing, Communication, and Computation Networks

no code implementations26 Oct 2023 Bin Li, Wenshuai Liu, Wancheng Xie, Ning Zhang, Yan Zhang

In this paper, we study a digital twin (DT)-empowered integrated sensing, communication, and computation network.

Edge-computing

You Only Look at Once for Real-time and Generic Multi-Task

1 code implementation2 Oct 2023 Jiayuan Wang, Q. M. Jonathan Wu, Ning Zhang

In this study, we incorporate A-YOLOM, an adaptive, real-time, and lightweight multi-task model designed to concurrently address object detection, drivable area segmentation, and lane line segmentation tasks.

Autonomous Driving Drivable Area Detection +5

Event-Driven Imaging in Turbid Media: A Confluence of Optoelectronics and Neuromorphic Computation

no code implementations13 Sep 2023 Ning Zhang, Timothy Shea, Arto Nurmikko

In this paper a new optical-computational method is introduced to unveil images of targets whose visibility is severely obscured by light scattering in dense, turbid media.

Image Reconstruction

Robust Computation Offloading and Trajectory Optimization for Multi-UAV-Assisted MEC: A Multi-Agent DRL Approach

no code implementations24 Aug 2023 Bin Li, Rongrong Yang, Lei Liu, Junyi Wang, Ning Zhang, Mianxiong Dong

For multiple Unmanned-Aerial-Vehicles (UAVs) assisted Mobile Edge Computing (MEC) networks, we study the problem of combined computation and communication for user equipments deployed with multi-type tasks.

Deep Reinforcement Learning Edge-computing +1

Federated Learning Robust to Byzantine Attacks: Achieving Zero Optimality Gap

no code implementations21 Aug 2023 Shiyuan Zuo, Rongfei Fan, Han Hu, Ning Zhang, Shimin Gong

In this paper, we propose a robust aggregation method for federated learning (FL) that can effectively tackle malicious Byzantine attacks.

Federated Learning

Joint Power Control and Data Size Selection for Over-the-Air Computation Aided Federated Learning

1 code implementation17 Aug 2023 Xuming An, Rongfei Fan, Shiyuan Zuo, Han Hu, Hai Jiang, Ning Zhang

For parameter aggregating in FL, over-the-air computation is a spectrum-efficient solution, which allows all mobile devices to transmit their parameter-mapped signals concurrently to a BS.

Federated Learning

Symbol-level Integrated Sensing and Communication enabled Multiple Base Stations Cooperative Sensing

no code implementations13 Aug 2023 Zhiqing Wei, Ruizhong Xu, Zhiyong Feng, Huici Wu, Ning Zhang, Wangjun Jiang, Xiaoyu Yang

This work may provide a guideline for the design of multi-BS cooperative sensing system to exploit the widely deployed networked mobile communication system.

Line Spectrum Estimation and Detection with Few-bit ADCs: Theoretical Analysis and Generalized NOMP Algorithm

no code implementations2 Jul 2023 Jiang Zhu, Hansheng Zhang, Ning Zhang, Jun Fang, Fengzhong Qu

As radar systems will be equipped with thousands of antenna elements and wide bandwidth, the associated costs and power consumption become exceedingly high, and a potential solution is to adopt low-resolution quantization technology, which not only reduces data storage needs but also lowers power and hardware costs.

Quantization Super-Resolution

Fusing Structural and Functional Connectivities using Disentangled VAE for Detecting MCI

no code implementations16 Jun 2023 Qiankun Zuo, Yanfei Zhu, Libin Lu, Zhi Yang, Yuhui Li, Ning Zhang

In this paper, a novel hierarchical structural-functional connectivity fusing (HSCF) model is proposed to construct brain structural-functional connectivity matrices and predict abnormal brain connections based on functional magnetic resonance imaging (fMRI) and diffusion tensor imaging (DTI).

Functional Connectivity

Learning and Verification of Task Structure in Instructional Videos

no code implementations23 Mar 2023 Medhini Narasimhan, Licheng Yu, Sean Bell, Ning Zhang, Trevor Darrell

We introduce a new pre-trained video model, VideoTaskformer, focused on representing the semantics and structure of instructional videos.

Activity Recognition

Que2Engage: Embedding-based Retrieval for Relevant and Engaging Products at Facebook Marketplace

no code implementations21 Feb 2023 Yunzhong He, Yuxin Tian, Mengjiao Wang, Feier Chen, Licheng Yu, Maolong Tang, Congcong Chen, Ning Zhang, Bin Kuang, Arul Prakash

In this paper we presents Que2Engage, a search EBR system built towards bridging the gap between retrieval and ranking for end-to-end optimizations.

Retrieval

HSTFormer: Hierarchical Spatial-Temporal Transformers for 3D Human Pose Estimation

no code implementations18 Jan 2023 Xiaoye Qian, YouBao Tang, Ning Zhang, Mei Han, Jing Xiao, Ming-Chun Huang, Ruei-Sung Lin

To mitigate this issue, we propose Hierarchical Spatial-Temporal transFormers (HSTFormer) to capture multi-level joints' spatial-temporal correlations from local to global gradually for accurate 3D HPE.

3D Human Pose Estimation

Artificial intelligence for diagnosing and predicting survival of patients with renal cell carcinoma: Retrospective multi-center study

no code implementations12 Jan 2023 Siteng Chen, Xiyue Wang, Jun Zhang, Liren Jiang, Ning Zhang, Feng Gao, Wei Yang, Jinxi Xiang, Sen yang, Junhua Zheng, Xiao Han

The OSrisk for the prediction of 5-year survival status achieved AUC of 0. 784 (0. 746-0. 819) in the TCGA cohort, which was further verified in the independent General cohort and the CPTAC cohort, with AUC of 0. 774 (0. 723-0. 820) and 0. 702 (0. 632-0. 765), respectively.

whole slide images

Beef up mmWave Dense Cellular Networks with D2D-Assisted Cooperative Edge Caching

no code implementations2 Jan 2023 Wen Wu, Ning Zhang, Nan Cheng, Yujie Tang, Khalid Aldubaikhy, Xuemin, Shen

In this paper, we propose a device-to-device (D2D) assisted cooperative edge caching (DCEC) policy for millimeter (mmWave) dense networks, which cooperatively utilizes the cache resource of users and SBSs in proximity.

Retrieval

Performance Analysis and Enhancement of Beamforming Training in 802.11ad

no code implementations2 Jan 2023 Wen Wu, Nan Cheng, Ning Zhang, Peng Yang, Khalid Aldubaikhy, Xuemin, Shen

Since the derived BF training efficiency is an implicit function, to reveal the relationship between system parameters and BF training performance, we also derive an approximate expression of BF training efficiency.

Revisiting the Stack-Based Inverse Tone Mapping

no code implementations CVPR 2023 Ning Zhang, Yuyao Ye, Yang Zhao, Ronggang Wang

In this paper, we revisit the stack-based ITM approaches and propose a novel method to reconstruct HDR radiance from a single image, which only needs to estimate two exposure images.

Tone Mapping

SlowLiDAR: Increasing the Latency of LiDAR-Based Detection Using Adversarial Examples

1 code implementation CVPR 2023 Han Liu, Yuhao Wu, Zhiyuan Yu, Yevgeniy Vorobeychik, Ning Zhang

LiDAR-based perception is a central component of autonomous driving, playing a key role in tasks such as vehicle localization and obstacle detection.

Autonomous Driving

Cost-Effective Two-Stage Network Slicing for Edge-Cloud Orchestrated Vehicular Networks

no code implementations31 Dec 2022 Wen Wu, Kaige Qu, Peng Yang, Ning Zhang, Xuemin, Shen, Weihua Zhuang

Since the problem is NP-hard due to coupled network planning and network operation stages, we develop a Two timescAle netWork Slicing (TAWS) algorithm by collaboratively integrating reinforcement learning (RL) and optimization methods, which can jointly make network planning and operation decisions.

Reinforcement Learning (RL) Stochastic Optimization

CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields

no code implementations23 Nov 2022 Keqiang Sun, Shangzhe Wu, Ning Zhang, Zhaoyang Huang, Quan Wang, Hongsheng Li

Capitalizing on the recent advances in image generation models, existing controllable face image synthesis methods are able to generate high-fidelity images with some levels of controllability, e. g., controlling the shapes, expressions, textures, and poses of the generated face images.

Face Generation

Prior-enhanced Temporal Action Localization using Subject-aware Spatial Attention

no code implementations10 Nov 2022 Yifan Liu, YouBao Tang, Ning Zhang, Ruei-Sung Lin, Haoqian Wang

Temporal action localization (TAL) aims to detect the boundary and identify the class of each action instance in a long untrimmed video.

Optical Flow Estimation Temporal Action Localization

Reward Delay Attacks on Deep Reinforcement Learning

1 code implementation8 Sep 2022 Anindya Sarkar, Jiarui Feng, Yevgeniy Vorobeychik, Christopher Gill, Ning Zhang

We find that this mitigation remains insufficient to ensure robustness to attacks that delay, but preserve the order, of rewards.

Deep Reinforcement Learning Q-Learning +2

PieTrack: An MOT solution based on synthetic data training and self-supervised domain adaptation

no code implementations22 Jul 2022 Yirui Wang, Shenghua He, YouBao Tang, Jingyu Chen, Honghao Zhou, Sanliang Hong, Junjie Liang, Yanxin Huang, Ning Zhang, Ruei-Sung Lin, Mei Han

In order to cope with the increasing demand for labeling data and privacy issues with human detection, synthetic data has been used as a substitute and showing promising results in human detection and tracking tasks.

Benchmarking Domain Adaptation +1

Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields

no code implementations16 Jun 2022 Keqiang Sun, Shangzhe Wu, Zhaoyang Huang, Ning Zhang, Quan Wang, Hongsheng Li

Capitalizing on the recent advances in image generation models, existing controllable face image synthesis methods are able to generate high-fidelity images with some levels of controllability, e. g., controlling the shapes, expressions, textures, and poses of the generated face images.

Face Generation

FadMan: Federated Anomaly Detection across Multiple Attributed Networks

no code implementations27 May 2022 Nannan Wu, Ning Zhang, Wenjun Wang, Lixin Fan, Qiang Yang

The proposed algorithm FadMan is a vertical federated learning framework for public node aligned with many private nodes of different features, and is validated on two tasks correlated anomaly detection on multiple attributed networks and anomaly detection on an attributeless network using five real-world datasets.

Anomaly Detection Data Integration +1

Stability of China's Stock Market: Measure and Forecast by Ricci Curvature on Network

no code implementations14 Apr 2022 Xinyu Wang, Liang Zhao, Ning Zhang, Liu Feng, Haibo Lin

As far as we know, this is the first paper to apply Ricci curvature to forecast the systemic stability of domestic stock market, and our results show that Ricci curvature has good explanatory power for the market stability and can be a good indicator to judge the future risk and volatility of the domestic market.

Time Series Time Series Analysis

D3PG: Dirichlet DDPG for Task Partitioning and Offloading With Constrained Hybrid Action Space in Mobile-Edge Computing

no code implementations IEEE 2022 Laha Ale, Scott A. King, Ning Zhang, Abdul Rahman Sattar, Janahan Skandaraniyam

Mobile-edge computing (MEC) has been regarded as a promising paradigm to reduce service latency for data processing in the Internet of Things (IoT) by provisioning computing resources at the network edges.

Deep Reinforcement Learning Edge-computing +3

Mobile Wireless Rechargeable UAV Networks: Challenges and Solutions

no code implementations24 Mar 2022 Yuntao Wang, Zhou Su, Ning Zhang, Ruidong Li

Unmanned aerial vehicles (UAVs) can help facilitate cost-effective and flexible service provisioning in future smart cities.

energy trading Privacy Preserving +1

Hierarchical BERT for Medical Document Understanding

no code implementations11 Mar 2022 Ning Zhang, Maciej Jankowski

Other than the ICD code assignment, we conduct a variety of other NLU tasks on a large commercial dataset named as TrialTrove, to showcase MDBERT's strength in delivering different levels of semantics.

document understanding Sentence

Towards Tailored Models on Private AIoT Devices: Federated Direct Neural Architecture Search

no code implementations23 Feb 2022 Chunhui Zhang, Xiaoming Yuan, Qianyun Zhang, Guangxu Zhu, Lei Cheng, Ning Zhang

To further adapt to both various data distributions and different types of devices with heterogeneous embedded hardware platforms, inspired by meta-learning, a Cluster Federated Direct Neural Architecture Search (CFDNAS) framework is proposed to achieve device-aware NAS, in the sense that each device can learn a tailored deep learning model for its particular data distribution and hardware constraint.

Federated Learning Meta-Learning +1

CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval

no code implementations15 Feb 2022 Licheng Yu, Jun Chen, Animesh Sinha, Mengjiao MJ Wang, Hugo Chen, Tamara L. Berg, Ning Zhang

We introduce CommerceMM - a multimodal model capable of providing a diverse and granular understanding of commerce topics associated to the given piece of content (image, text, image+text), and having the capability to generalize to a wide range of tasks, including Multimodal Categorization, Image-Text Retrieval, Query-to-Product Retrieval, Image-to-Product Retrieval, etc.

Image-text Retrieval Representation Learning +1

A Knowledge-Based Decision Support System for In Vitro Fertilization Treatment

no code implementations27 Jan 2022 Xizhe Wang, Ning Zhang, Jia Wang, Jing Ni, Xinzi Sun, John Zhang, Zitao Liu, Yu Cao, Benyuan Liu

To improve the IVF success rate, we propose a knowledge-based decision support system that can provide medical advice on the treatment protocol and medication adjustment for each patient visit during IVF treatment cycle.

Retrieval

Smart Director: An Event-Driven Directing System for Live Broadcasting

no code implementations11 Jan 2022 Yingwei Pan, Yue Chen, Qian Bao, Ning Zhang, Ting Yao, Jingen Liu, Tao Mei

To our best knowledge, our system is the first end-to-end automated directing system for multi-camera sports broadcasting, completely driven by the semantic understanding of sports events.

Event Detection Highlight Detection

Joint Task Offloading and Resource Allocation for IoT Edge Computing with Sequential Task Dependency

no code implementations23 Oct 2021 Xuming An, Rongfei Fan, Han Hu, Ning Zhang, Saman Atapattu, Theodoros A. Tsiftsis

To solve this challenging problem, we decompose it as a one-dimensional search of task offloading decision problem and a non-convex optimization problem with task offloading decision given.

Edge-computing

PROVES: Establishing Image Provenance using Semantic Signatures

1 code implementation21 Oct 2021 Mingyang Xie, Manav Kulshrestha, Shaojie Wang, Jinghan Yang, Ayan Chakrabarti, Ning Zhang, Yevgeniy Vorobeychik

Modern AI tools, such as generative adversarial networks, have transformed our ability to create and modify visual data with photorealistic results.

Face Verification

Frequency-Constrained Resilient Scheduling of Microgrid: A Distributionally Robust Approach

no code implementations30 Jun 2021 Zhongda Chu, Ning Zhang, Fei Teng

In order to prevent the potential frequency instability due to the high Power Electronics (PE) penetration under an unintentional islanding event, this paper presents a novel microgrid scheduling model which explicitly models the system frequency dynamics as well as the long/short term uncertainty associated with renewable energy resources and load.

Scheduling

Enabling Long-Term Cooperation in Cross-Silo Federated Learning: A Repeated Game Perspective

no code implementations22 Jun 2021 Ning Zhang, Qian Ma, Xu Chen

We show that enforced by a punishment strategy, such a cooperative strategy is a subgame perfect Nash equilibrium (SPNE) of the infinitely repeated game, under which some clients who are free riders at the NE of the stage game choose to be (partial) contributors.

Federated Learning

Deriving Autism Spectrum Disorder Functional Networks from RS-FMRI Data using Group ICA and Dictionary Learning

1 code implementation7 Jun 2021 Xin Yang, Ning Zhang, Donglin Wang

Fourth, we generate three corresponding masks based on the 20 selected ROIs from group ICA, the 20 ROIs selected from dictionary learning, and the 40 combined ROIs selected from both.

Dictionary Learning Functional Connectivity

AI-Native Network Slicing for 6G Networks

no code implementations18 May 2021 Wen Wu, Conghao Zhou, Mushu Li, Huaqing Wu, Haibo Zhou, Ning Zhang, Xuemin, Shen, Weihua Zhuang

Then, network slicing solutions are studied to support emerging AI services by constructing AI instances and performing efficient resource management, i. e., slicing for AI.

Management

Connecting What to Say With Where to Look by Modeling Human Attention Traces

1 code implementation CVPR 2021 Zihang Meng, Licheng Yu, Ning Zhang, Tamara Berg, Babak Damavandi, Vikas Singh, Amy Bearman

Learning the grounding of each word is challenging, due to noise in the human-provided traces and the presence of words that cannot be meaningfully visually grounded.

Caption Generation Image Captioning +1

Topology Learning Aided False Data Injection Attack without Prior Topology Information

no code implementations24 Feb 2021 Martin Higgins, Jiawei Zhang, Ning Zhang, Fei Teng

False Data Injection (FDI) attacks against powersystem state estimation are a growing concern for operators. Previously, most works on FDI attacks have been performedunder the assumption of the attacker having full knowledge ofthe underlying system without clear justification.

Range decreasing group homomorphisms and holomorphic maps between generalized loop spaces

no code implementations11 Feb 2021 Ning Zhang

$\mathcal{F}^{0}(V, M)$ be the component of $\mathcal{F}(V, \mathcal{G})$ resp.

Complex Variables Functional Analysis Representation Theory 58D15, 46T25, 22E66, 58C10, 32H02

Newtonalized Orthogonal Matching Pursuit for Linear Frequency Modulated Pulse Frequency Agile Radar

no code implementations29 Jan 2021 Jiang Zhu, Honghui Guo, Ning Zhang, Chunyi Song, Zhiwei Xu

The linear frequency modulated (LFM) frequency agile radar (FAR) can synthesize a wide signal bandwidth through coherent processing while keeping the bandwidth of each pulse narrow.

Binary Black-box Evasion Attacks Against Deep Learning-based Static Malware Detectors with Adversarial Byte-Level Language Model

1 code implementation14 Dec 2020 MohammadReza Ebrahimi, Ning Zhang, James Hu, Muhammad Taqi Raza, Hsinchun Chen

Recently, deep learning-based static anti-malware detectors have achieved success in identifying unseen attacks without requiring feature engineering and dynamic analysis.

Deep Learning Feature Engineering +2

Joint Task Offloading and Resource Allocation for IoT Edge Computing with Sequential Task Dependency

no code implementations25 Nov 2020 Xuming An, Rongfei Fan, Han Hu, Ning Zhang, Saman Atapattu, Theodoros A. Tsiftsis

To solve this challenging problem, we decompose it as a one-dimensional search of task offloading decision problem and a non-convex optimization problem with task offloading decision given.

Edge-computing Information Theory Information Theory

DeepSlicing: Deep Reinforcement Learning Assisted Resource Allocation for Network Slicing

no code implementations17 Aug 2020 Qiang Liu, Tao Han, Ning Zhang, Ye Wang

Network slicing enables multiple virtual networks run on the same physical infrastructure to support various use cases in 5G and beyond.

Deep Reinforcement Learning reinforcement-learning +1

Computation Offloading in Beyond 5G Networks: A Distributed Learning Framework and Applications

no code implementations15 Jul 2020 Xianfu Chen, Celimuge Wu, Zhi Liu, Ning Zhang, Yusheng Ji

Facing the trend of merging wireless communications and multi-access edge computing (MEC), this article studies computation offloading in the beyond fifth-generation networks.

Edge-computing Reinforcement Learning (RL)

Energy and Information Management of Electric Vehicular Network: A Survey

no code implementations17 May 2020 Nan Chen, Miao Wang, Ning Zhang, Xuemin, Shen

In this paper, we provide a comprehensive survey on the deployment and management of EVN considering all three aspects of energy flow, data communication, and computation.

Management Scheduling +1

Robust Visual Object Tracking with Two-Stream Residual Convolutional Networks

no code implementations13 May 2020 Ning Zhang, Jingen Liu, Ke Wang, Dan Zeng, Tao Mei

Inspired by the human "visual tracking" capability which leverages motion cues to distinguish the target from the background, we propose a Two-Stream Residual Convolutional Network (TS-RCN) for visual tracking, which successfully exploits both appearance and motion features for model update.

Visual Object Tracking Visual Tracking +1

Sparse Oblique Decision Tree for Power System Security Rules Extraction and Embedding

no code implementations20 Apr 2020 Qingchun Hou, Ning Zhang, Daniel S. Kirschen, Ershun Du, Yaohua Cheng, Chongqing Kang

Data-driven techniques provide a promising way to identify security rules that can be embedded in economic dispatch model to keep power system operating states secure.

DeepEDN: A Deep Learning-based Image Encryption and Decryption Network for Internet of Medical Things

no code implementations12 Apr 2020 Yi Ding, Guozheng Wu, Dajiang Chen, Ning Zhang, Linpeng Gong, Mingsheng Cao, Zhiguang Qin

Specifically, in DeepEDN, the Cycle-Generative Adversarial Network (Cycle-GAN) is employed as the main learning network to transfer the medical image from its original domain into the target domain.

Generative Adversarial Network

3D Aggregated Faster R-CNN for General Lesion Detection

no code implementations29 Jan 2020 Ning Zhang, Yu Cao, Benyuan Liu, Yan Luo

This classifier branch is equipped with Feature Aggregation and Local Magnification Layers to enhance the classifier branch.

Computed Tomography (CT) Lesion Detection +2

Enjoy the Untrusted Cloud: A Secure, Scalable and Efficient SQL-like Query Framework for Outsourcing Data

no code implementations18 Dec 2019 Yaxing Chen, Qinghua Zheng, Dan Liu, Zheng Yan, Wenhai Sun, Ning Zhang, Wenjing Lou, Y. Thomas Hou

On one hand, such work lacks of supporting scalable access control over multiple data users.

Cryptography and Security Databases Distributed, Parallel, and Cluster Computing

Mini Lesions Detection on Diabetic Retinopathy Images via Large Scale CNN Features

no code implementations19 Nov 2019 Qilei Chen, Xinzi Sun, Ning Zhang, Yu Cao, Benyuan Liu

We analyze the lesion-vs-image scale carefully and propose a large-size feature pyramid network (LFPN) to preserve more image details for mini lesion instance detection.

Lesion Detection object-detection +2

Bounding Regression Errors in Data-driven Power Grid Steady-state Models

no code implementations30 Oct 2019 Yuxiao Liu, Bolun Xu, Audun Botterud, Ning Zhang, Chongqing Kang

Results identify how the bounds decrease with additional power grid physical knowledge or more training data.

regression

Fast mmwave Beam Alignment via Correlated Bandit Learning

no code implementations7 Sep 2019 Wen Wu, Nan Cheng, Ning Zhang, Peng Yang, Weihua Zhuang, Xuemin, Shen

Beam alignment (BA) is to ensure the transmitter and receiver beams are accurately aligned to establish a reliable communication link in millimeter-wave (mmwave) systems.

AFP-Net: Realtime Anchor-Free Polyp Detection in Colonoscopy

no code implementations5 Sep 2019 Dechun Wang, Ning Zhang, Xinzi Sun, Pengfei Zhang, Chenxi Zhang, Yu Cao, Benyuan Liu

Though challenging, with the great advances in object detection techniques, automated polyp detection still demonstrates a great potential in reducing the false negative rate while maintaining a high precision.

object-detection Object Detection

Dynamic Kernel Distillation for Efficient Pose Estimation in Videos

no code implementations ICCV 2019 Xuecheng Nie, Yuncheng Li, Linjie Luo, Ning Zhang, Jiashi Feng

Existing video-based human pose estimation methods extensively apply large networks onto every frame in the video to localize body joints, which suffer high computational cost and hardly meet the low-latency requirement in realistic applications.

2D Human Pose Estimation Pose Estimation

Weakly Supervised Body Part Segmentation with Pose based Part Priors

no code implementations30 Jul 2019 Zhengyuan Yang, Yuncheng Li, Linjie Yang, Ning Zhang, Jiebo Luo

The core idea is first converting the sparse weak labels such as keypoints to the initial estimate of body part masks, and then iteratively refine the part mask predictions.

Face Parsing Segmentation +1

Gridless Multisnapshot Variational Line Spectral Estimation from Coarsely Quantized Samples

no code implementations20 Jun 2019 Ning Zhang, Jiang Zhu, Zhiwei Xu

Due to the increasing demand for low power and higher sampling rates, low resolution quantization for data acquisition has drawn great attention recently.

Quantization

Finding Needles in a Moving Haystack: Prioritizing Alerts with Adversarial Reinforcement Learning

no code implementations20 Jun 2019 Liang Tong, Aron Laszka, Chao Yan, Ning Zhang, Yevgeniy Vorobeychik

We then use these in a double-oracle framework to obtain an approximate equilibrium of the game, which in turn yields a robust stochastic policy for the defender.

Intrusion Detection reinforcement-learning +2

Online Learning for Supervised Dimension Reduction

no code implementations ICLR 2019 Ning Zhang, Qiang Wu

The purpose of this paper is to propose new online learning approaches for supervised dimension reduction.

Dimensionality Reduction regression

Context-Aware Zero-Shot Recognition

1 code implementation19 Apr 2019 Ruotian Luo, Ning Zhang, Bohyung Han, Linjie Yang

We present a novel problem setting in zero-shot learning, zero-shot object recognition and detection in the context.

Object Recognition Zero-Shot Learning

Laplace Landmark Localization

no code implementations ICCV 2019 Joseph P. Robinson, Yuncheng Li, Ning Zhang, Yun Fu, and Sergey Tulyakov

Our method claims state-of-the-art on all of the 300W benchmarks and ranks second-to-best on the Annotated Facial Landmarks in the Wild (AFLW) dataset.

Ranked #5 on Face Alignment on AFLW-19 (NME_box (%, Full) metric)

Face Alignment Facial Landmark Detection

Visual Attention Model for Name Tagging in Multimodal Social Media

no code implementations ACL 2018 Di Lu, Leonardo Neves, Vitor Carvalho, Ning Zhang, Heng Ji

Everyday billions of multimodal posts containing both images and text are shared in social media sites such as Snapchat, Twitter or Instagram.

Natural Language Understanding Question Answering

Overlapping Sliced Inverse Regression for Dimension Reduction

no code implementations23 Jun 2018 Ning Zhang, Zhou Yu, Qiang Wu

The new algorithm, called overlapping sliced inverse regression (OSIR), is able to estimate the effective dimension reduction space and determine the number of effective factors more accurately.

Dimensionality Reduction regression

An Aggressive Genetic Programming Approach for Searching Neural Network Structure Under Computational Constraints

no code implementations3 Jun 2018 Zhe Li, Xuehan Xiong, Zhou Ren, Ning Zhang, Xiaoyu Wang, Tianbao Yang

In this paper, we study how to design a genetic programming approach for optimizing the structure of a CNN for a given task under limited computational resources yet without imposing strong restrictions on the search space.

Diversity Evolutionary Algorithms

Improving Robustness of ML Classifiers against Realizable Evasion Attacks Using Conserved Features

no code implementations28 Aug 2017 Liang Tong, Bo Li, Chen Hajaj, Chaowei Xiao, Ning Zhang, Yevgeniy Vorobeychik

A conventional approach to evaluate ML robustness to such attacks, as well as to design robust ML, is by considering simplified feature-space models of attacks, where the attacker changes ML features directly to effect evasion, while minimizing or constraining the magnitude of this change.

Intrusion Detection Malware Detection

Interactions of Fungi with Concrete: Significant Importance for Bio-Based Self-Healing Concrete

no code implementations4 Aug 2017 Jing Luo, Xiaobo Chen, Jada Crump, Hui Zhou, David G. Davies, Guangwen Zhou, Ning Zhang, Congrui Jin

The goal of this study is to explore a new self-healing concept in which fungi are used as a self-healing agent to promote calcium mineral precipitation to fill the cracks in concrete.

X-Ray Diffraction (XRD)

Deep Reinforcement Learning-based Image Captioning with Embedding Reward

no code implementations CVPR 2017 Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, Li-Jia Li

The policy network serves as a local guidance by providing the confidence of predicting the next word according to the current state.

Decision Making Decoder +4

AutoScaler: Scale-Attention Networks for Visual Correspondence

no code implementations17 Nov 2016 Shenlong Wang, Linjie Luo, Ning Zhang, Jia Li

We propose AutoScaler, a scale-attention network to explicitly optimize this trade-off in visual correspondence tasks.

Optical Flow Estimation

A Hierarchical Distributed Processing Framework for Big Image Data

no code implementations3 Jul 2016 Le Dong, Zhiyu Lin, Yan Liang, Ling He, Ning Zhang, Qi Chen, Xiaochun Cao, Ebroul lzquierdo

The proposed ICP framework consists of two mechanisms, i. e. SICP (Static ICP) and DICP (Dynamic ICP).

Fine-grained pose prediction, normalization, and recognition

no code implementations22 Nov 2015 Ning Zhang, Evan Shelhamer, Yang Gao, Trevor Darrell

Pose variation and subtle differences in appearance are key challenges to fine-grained classification.

General Classification Pose Prediction

Compact Bilinear Pooling

7 code implementations CVPR 2016 Yang Gao, Oscar Beijbom, Ning Zhang, Trevor Darrell

Bilinear models has been shown to achieve impressive performance on a wide range of visual tasks, such as semantic segmentation, fine grained recognition and face recognition.

Face Recognition Few-Shot Learning +3

Sparse learning of maximum likelihood model for optimization of complex loss function

no code implementations18 Nov 2015 Ning Zhang, Prathamesh Chandrasekar

In this paper, we study the problem of optimizing the complex performance measure directly to obtain a predictive model.

Image Classification Sparse Learning

Deep Domain Confusion: Maximizing for Domain Invariance

7 code implementations10 Dec 2014 Eric Tzeng, Judy Hoffman, Ning Zhang, Kate Saenko, Trevor Darrell

Recent reports suggest that a generic supervised deep CNN model trained on a large-scale dataset reduces, but does not remove, dataset bias on a standard benchmark.

Domain Adaptation Model Selection +1

Do Convnets Learn Correspondence?

no code implementations NeurIPS 2014 Jonathan Long, Ning Zhang, Trevor Darrell

Convolutional neural nets (convnets) trained from massive labeled datasets have substantially improved the state-of-the-art in image classification and object detection.

General Classification Image Classification +3

Part-based R-CNNs for Fine-grained Category Detection

no code implementations15 Jul 2014 Ning Zhang, Jeff Donahue, Ross Girshick, Trevor Darrell

Semantic part localization can facilitate fine-grained categorization by explicitly isolating subtle appearance differences associated with specific object parts.

Fine-Grained Image Classification Object +2

PANDA: Pose Aligned Networks for Deep Attribute Modeling

1 code implementation CVPR 2014 Ning Zhang, Manohar Paluri, Marc'Aurelio Ranzato, Trevor Darrell, Lubomir Bourdev

We propose a method for inferring human attributes (such as gender, hair style, clothes style, expression, action) from images of people under large variation of viewpoint, pose, appearance, articulation and occlusion.

Attribute Facial Attribute Classification +2

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition

8 code implementations6 Oct 2013 Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, Trevor Darrell

We evaluate whether features extracted from the activation of a deep convolutional network trained in a fully supervised fashion on a large, fixed set of object recognition tasks can be re-purposed to novel generic tasks.

Clustering Domain Adaptation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.