Search Results for author: C. Lee Giles

Found 57 papers, 10 papers with code

Learning CNF Blocking for Large-scale Author Name Disambiguation

no code implementations EMNLP (sdp) 2020 Kunho Kim, Athar Sefid, C. Lee Giles

Author name disambiguation (AND) algorithms identify a unique author entity record from all similar or same publication records in scholarly or similar databases.

Blocking Clustering

Acknowledgement Entity Recognition in CORD-19 Papers

1 code implementation EMNLP (sdp) 2020 Jian Wu, Pei Wang, Xin Wei, Sarah Rajtmajer, C. Lee Giles, Christopher Griffin

We built a supplementary database by linking CORD-19 papers with acknowledgement entities extracted by AckExtract including persons and organizations and find that only up to 50–60% of named entities are actually acknowledged.

Sentence

Automated Detection and Analysis of Data Practices Using A Real-World Corpus

no code implementations16 Feb 2024 Mukund Srinath, Pranav Venkit, Maria Badillo, Florian Schaub, C. Lee Giles, Shomir Wilson

Privacy policies are crucial for informing users about data practices, yet their length and complexity often deter users from reading them.

Stability Analysis of Various Symbolic Rule Extraction Methods from Recurrent Neural Network

no code implementations4 Feb 2024 Neisarg Dave, Daniel Kifer, C. Lee Giles, Ankur Mali

We sampled the datasets from $7$ Tomita and $4$ Dyck grammars and trained them on $4$ RNN cells: LSTM, GRU, O2RNN, and MIRNN.

Quantization

GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions

no code implementations23 Oct 2023 Ting-Yao Hsu, Chieh-Yang Huang, Ryan Rossi, Sungchul Kim, C. Lee Giles, Ting-Hao K. Huang

We first constructed SCICAP-EVAL, a human evaluation dataset that contains human judgments for 3, 600 scientific figure captions, both original and machine-made, for 600 arXiv figures.

Document Domain Randomization for Deep Learning Document Layout Extraction

no code implementations20 May 2021 Meng Ling, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee, Han-Wei Shen, Jian Wu, C. Lee Giles

We present document domain randomization (DDR), the first successful transfer of convolutional neural networks (CNNs) trained only on graphically rendered pseudo-paper pages to real-world document segmentation.

Document Layout Analysis

Recognizing and Verifying Mathematical Equations using Multiplicative Differential Neural Units

no code implementations7 Apr 2021 Ankur Mali, Alexander Ororbia, Daniel Kifer, C. Lee Giles

Two particular tasks that test this type of reasoning are (1) mathematical equation verification, which requires determining whether trigonometric and linear algebraic statements are valid identities or not, and (2) equation completion, which entails filling in a blank within an expression to make it true.

Mathematical Reasoning

Design and Analysis of a Synthetic Prediction Market using Dynamic Convex Sets

no code implementations5 Jan 2021 Nishanth Nakshatri, Arjun Menon, C. Lee Giles, Sarah Rajtmajer, Christopher Griffin

We show that under certain assumptions on the underlying geometry, the resulting synthetic prediction market can be used to arbitrarily closely approximate a binary function defined on a set of input data.

Automating Document Classification with Distant Supervision to Increase the Efficiency of Systematic Reviews

no code implementations9 Dec 2020 Xiaoxiao Li, Rabah Al-Zaidy, Amy Zhang, Stefan Baral, Le Bao, C. Lee Giles

Conclusions: In sum, the automated procedure of document classification presented here could improve both the precision and efficiency of systematic reviews, as well as facilitating live reviews, where reviews are updated regularly.

Document Classification General Classification

Modeling Updates of Scholarly Webpages Using Archived Data

no code implementations7 Dec 2020 Yasith Jayawardana, Alexander C. Nwala, Gavindya Jayawardena, Jian Wu, Sampath Jayarathna, Michael L. Nelson, C. Lee Giles

The vastness of the web imposes a prohibitive cost on building large-scale search engines with limited resources.

Large Scale Subject Category Classification of Scholarly Papers with Deep Attentive Neural Networks

no code implementations27 Jul 2020 Bharath Kandimalla, Shaurya Rohatgi, Jian Wu, C. Lee Giles

The results showed the importance of retraining word embedding models to maximize the vocabulary overlap and the effectiveness of the attention mechanism.

General Classification Sentence

A provably stable neural network Turing Machine

no code implementations5 Jun 2020 John Stogin, Ankur Mali, C. Lee Giles

We introduce a neural stack architecture, including a differentiable parametrized stack operator that approximates stack push and pop operations for suitable choices of parameters that explicitly represents a stack.

CODA-19: Using a Non-Expert Crowd to Annotate Research Aspects on 10,000+ Abstracts in the COVID-19 Open Research Dataset

1 code implementation ACL 2020 Ting-Hao 'Kenneth' Huang, Chieh-Yang Huang, Chien-Kuang Cornelia Ding, Yen-Chia Hsu, C. Lee Giles

This paper introduces CODA-19, a human-annotated dataset that codes the Background, Purpose, Method, Finding/Contribution, and Other sections of 10, 966 English abstracts in the COVID-19 Open Research Dataset.

Large-Scale Gradient-Free Deep Learning with Recursive Local Representation Alignment

no code implementations10 Feb 2020 Alexander Ororbia, Ankur Mali, Daniel Kifer, C. Lee Giles

Training deep neural networks on large-scale datasets requires significant hardware resources whose costs (even on cloud platforms) put them out of reach of smaller organizations, groups, and individuals.

Query Auto Completion for Math Formula Search

no code implementations9 Dec 2019 Shaurya Rohatgi, Wei Zhong, Richard Zanibbi, Jian Wu, C. Lee Giles

Query Auto Completion (QAC) is among the most appealing features of a web search engine.

Math

Automatic Generation of Headlines for Online Math Questions

1 code implementation27 Nov 2019 Ke Yuan, Dafang He, Zhuoren Jiang, Liangcai Gao, Zhi Tang, C. Lee Giles

Compared to conventional summarization tasks, this task has two extra and essential constraints: 1) Detailed math questions consist of text and math equations which require a unified framework to jointly model textual and mathematical information; 2) Unlike text, math equations contain semantic and structural features, and both of them should be captured together.

Math

Connecting First and Second Order Recurrent Networks with Deterministic Finite Automata

1 code implementation12 Nov 2019 Qinglong Wang, Kaixuan Zhang, Xue Liu, C. Lee Giles

We propose an approach that connects recurrent networks with different orders of hidden interaction with regular grammars of different levels of complexity.

Shapley Homology: Topological Analysis of Sample Influence for Neural Networks

no code implementations15 Oct 2019 Kaixuan Zhang, Qinglong Wang, Xue Liu, C. Lee Giles

This has motivated different research areas such as data poisoning, model improvement, and explanation of machine learning models.

BIG-bench Machine Learning Data Poisoning

The Neural State Pushdown Automata

no code implementations7 Sep 2019 Ankur Mali, Alexander Ororbia, C. Lee Giles

The NSPDA is also compared to a classical analog stack neural network pushdown automaton (NNPDA) as well as a wide array of first and second-order RNNs with and without external memory, trained using different learning algorithms.

Incremental Learning Tensor Networks

Lifelong Neural Predictive Coding: Learning Cumulatively Online without Forgetting

no code implementations25 May 2019 Alexander Ororbia, Ankur Mali, Daniel Kifer, C. Lee Giles

In lifelong learning systems based on artificial neural networks, one of the biggest obstacles is the inability to retain old knowledge as new information is encountered.

Verification of Recurrent Neural Networks Through Rule Extraction

no code implementations14 Nov 2018 Qinglong Wang, Kaixuan Zhang, Xue Liu, C. Lee Giles

The verification problem for neural networks is verifying whether a neural network will suffer from adversarial samples, or approximating the maximal allowed scale of adversarial perturbation that can be endured.

Continual Learning of Recurrent Neural Networks by Locally Aligning Distributed Representations

1 code implementation17 Oct 2018 Alexander Ororbia, Ankur Mali, C. Lee Giles, Daniel Kifer

We compare our model and learning procedure to other back-propagation through time alternatives (which also tend to be computationally expensive), including real-time recurrent learning, echo state networks, and unbiased online recurrent optimization.

Continual Learning Language Modelling +1

A Neural Temporal Model for Human Motion Prediction

1 code implementation CVPR 2019 Anand Gopalakrishnan, Ankur Mali, Dan Kifer, C. Lee Giles, Alexander G. Ororbia

We propose novel neural temporal models for predicting and synthesizing human motion, achieving state-of-the-art in modeling long-term motion trajectories while being competitive with prior work in short-term prediction and requiring significantly less computation.

Human motion prediction motion prediction +1

TextContourNet: a Flexible and Effective Framework for Improving Scene Text Detection Architecture with a Multi-task Cascade

no code implementations9 Sep 2018 Dafang He, Xiao Yang, Daniel Kifer, C. Lee Giles

We propose a novel and effective framework for this and experimentally demonstrate that: (1) A CNN that can be effectively used to extract instance-level text contour from natural images.

Scene Text Detection Text Detection

Distractor Generation for Multiple Choice Questions Using Learning to Rank

1 code implementation WS 2018 Chen Liang, Xiao Yang, Neisarg Dave, Drew Wham, Bart Pursel, C. Lee Giles

We investigate how machine learning models, specifically ranking models, can be used to select useful distractors for multiple choice questions.

BIG-bench Machine Learning Distractor Generation +3

Adversarial Training for Community Question Answer Selection Based on Multi-scale Matching

no code implementations22 Apr 2018 Xiao Yang, Miaosen Wang, Wei Wang, Madian Khabsa, Ahmed Awadallah, Daniel Kifer, C. Lee Giles

We frame this task as a binary (relevant/irrelevant) classification problem, and present an adversarial training framework to alleviate label imbalance issue.

Answer Selection General Classification

Learned Neural Iterative Decoding for Lossy Image Compression Systems

no code implementations15 Mar 2018 Alexander G. Ororbia, Ankur Mali, Jian Wu, Scott O'Connell, David Miller, C. Lee Giles

For lossy image compression systems, we develop an algorithm, iterative refinement, to improve the decoder's reconstruction compared to standard decoding techniques.

Image Compression

Conducting Credit Assignment by Aligning Local Representations

no code implementations5 Mar 2018 Alexander G. Ororbia, Ankur Mali, Daniel Kifer, C. Lee Giles

Using back-propagation and its variants to train deep networks is often problematic for new users.

A Comparative Study of Rule Extraction for Recurrent Neural Networks

no code implementations16 Jan 2018 Qinglong Wang, Kaixuan Zhang, Alexander G. Ororbia II, Xinyu Xing, Xue Liu, C. Lee Giles

Then we empirically evaluate different recurrent networks for their performance of DFA extraction on all Tomita grammars.

Learning to Adapt by Minimizing Discrepancy

no code implementations30 Nov 2017 Alexander G. Ororbia II, Patrick Haffner, David Reitter, C. Lee Giles

We investigate the viability of a more neurocognitively-grounded approach in the context of unsupervised generative modeling of sequences.

An Empirical Evaluation of Rule Extraction from Recurrent Neural Networks

no code implementations29 Sep 2017 Qinglong Wang, Kaixuan Zhang, Alexander G. Ororbia II, Xinyu Xing, Xue Liu, C. Lee Giles

Rule extraction from black-box models is critical in domains that require model validation before implementation, as can be the case in credit scoring and medical diagnosis.

Medical Diagnosis

Learning a Hierarchical Latent-Variable Model of 3D Shapes

1 code implementation17 May 2017 Shikun Liu, C. Lee Giles, Alexander G. Ororbia II

We propose the Variational Shape Learner (VSL), a generative model that learns the underlying structure of voxelized 3D shapes in an unsupervised fashion.

3D Object Classification 3D Object Recognition +3

Learning Adversary-Resistant Deep Neural Networks

no code implementations5 Dec 2016 Qinglong Wang, Wenbo Guo, Kaixuan Zhang, Alexander G. Ororbia II, Xinyu Xing, Xue Liu, C. Lee Giles

Despite the superior performance of DNNs in these applications, it has been recently shown that these models are susceptible to a particular type of attack that exploits a fundamental flaw in their design.

Autonomous Vehicles

Using Non-invertible Data Transformations to Build Adversarial-Robust Neural Networks

no code implementations6 Oct 2016 Qinglong Wang, Wenbo Guo, Alexander G. Ororbia II, Xinyu Xing, Lin Lin, C. Lee Giles, Xue Liu, Peng Liu, Gang Xiong

Deep neural networks have proven to be quite effective in a wide variety of machine learning tasks, ranging from improved speech recognition systems to advancing the development of autonomous vehicles.

Autonomous Vehicles Dimensionality Reduction +2

Adversary Resistant Deep Neural Networks with an Application to Malware Detection

no code implementations5 Oct 2016 Qinglong Wang, Wenbo Guo, Kaixuan Zhang, Alexander G. Ororbia II, Xinyu Xing, C. Lee Giles, Xue Liu

However, after a thorough analysis of the fundamental flaw in DNNs, we discover that the effectiveness of current defenses is limited and, more importantly, cannot provide theoretical guarantees as to their robustness against adversarial sampled-based attacks.

Information Retrieval Malware Detection +3

Unifying Adversarial Training Algorithms with Flexible Deep Data Gradient Regularization

no code implementations26 Jan 2016 Alexander G. Ororbia II, C. Lee Giles, Daniel Kifer

Many previous proposals for adversarial training of deep neural nets have included di- rectly modifying the gradient, training on a mix of original and adversarial examples, using contractive penalties, and approximately optimizing constrained adversarial ob- jective functions.

Online Semi-Supervised Learning with Deep Hybrid Boltzmann Machines and Denoising Autoencoders

no code implementations22 Nov 2015 Alexander G. Ororbia II, C. Lee Giles, David Reitter

Two novel deep hybrid architectures, the Deep Hybrid Boltzmann Machine and the Deep Hybrid Denoising Auto-encoder, are proposed for handling semi-supervised learning problems.

Denoising

A neural probabilistic model for context based citation recommendation

no code implementations AAAI 2015 Wenyi Huang, Zhaohui Wu, Chen Liang, Prasenjit Mitra, C. Lee Giles

It is not always easy for knowledgeable researchers to give an accurate citation context for a cited paper or to find the right paper to cite given context.

Citation Recommendation

Cannot find the paper you are looking for? You can Submit a new open access paper.