Search Results for author: Charles Siegel

Found 10 papers, 5 papers with code

Recombination of Artificial Neural Networks

no code implementations • 12 Jan 2019 • Aaron Vose, Jacob Balma, Alex Heye, Alessandro Rigazzi, Charles Siegel, Diana Moise, Benjamin Robbins, Rangan Sukumar

We propose a genetic algorithm (GA) for hyperparameter optimization of artificial neural networks which includes chromosomal crossover as well as a decoupling of parameters (i. e., weights and biases) from hyperparameters (e. g., learning rate, weight decay, and dropout) during sexual reproduction.

Hyperparameter Optimization

Paper
Add Code

Multimodal Deep Neural Networks using Both Engineered and Learned Representations for Biodegradability Prediction

no code implementations • 13 Aug 2018 • Garrett B. Goh, Khushmeen Sakloth, Charles Siegel, Abhinav Vishnu, Jim Pfaendtner

Deep learning algorithms excel at extracting patterns from raw data, and with large datasets, they have been very successful in computer vision and natural language applications.

Feature Engineering Representation Learning

Paper
Add Code

ColdRoute: Effective Routing of Cold Questions in Stack Exchange Sites

1 code implementation • 2 Jul 2018 • Jiankai Sun, Abhinav Vishnu, Aniket Chakrabarti, Charles Siegel, Srinivasan Parthasarathy

Using data from eight stack exchange sites, we are able to improve upon the routing metrics (Precision$@1$, Accuracy, MRR) over the state-of-the-art models such as semantic matching by $159. 5\%$,$31. 84\%$, and $40. 36\%$ for cold questions posted by existing askers, and $123. 1\%$, $27. 03\%$, and $34. 81\%$ for cold questions posted by new askers respectively.

Paper
Code

GossipGraD: Scalable Deep Learning using Gossip Communication based Asynchronous Gradient Descent

no code implementations • 15 Mar 2018 • Jeff Daily, Abhinav Vishnu, Charles Siegel, Thomas Warfel, Vinay Amatya

In this paper, we present GossipGraD - a gossip communication protocol based Stochastic Gradient Descent (SGD) algorithm for scaling Deep Learning (DL) algorithms on large-scale systems.

Paper
Add Code

Using Rule-Based Labels for Weak Supervised Learning: A ChemNet for Transferable Chemical Property Prediction

1 code implementation • 7 Dec 2017 • Garrett B. Goh, Charles Siegel, Abhinav Vishnu, Nathan O. Hodas

With access to large datasets, deep neural networks (DNN) have achieved human-level accuracy in image and speech recognition tasks.

Property Prediction speech-recognition +2

Paper
Code

SMILES2Vec: An Interpretable General-Purpose Deep Neural Network for Predicting Chemical Properties

4 code implementations • 6 Dec 2017 • Garrett B. Goh, Nathan O. Hodas, Charles Siegel, Abhinav Vishnu

Chemical databases store information in text representations, and the SMILES format is a universal standard used in many cheminformatics software.

Bayesian Optimization Feature Engineering

Paper
Code

How Much Chemistry Does a Deep Neural Network Need to Know to Make Accurate Predictions?

2 code implementations • 5 Oct 2017 • Garrett B. Goh, Charles Siegel, Abhinav Vishnu, Nathan O. Hodas, Nathan Baker

The meteoric rise of deep learning models in computer vision research, having achieved human-level accuracy in image recognition tasks is firm evidence of the impact of representation learning of deep neural networks.

Representation Learning

Paper
Code

Chemception: A Deep Neural Network with Minimal Chemistry Knowledge Matches the Performance of Expert-developed QSAR/QSPR Models

2 code implementations • 20 Jun 2017 • Garrett B. Goh, Charles Siegel, Abhinav Vishnu, Nathan O. Hodas, Nathan Baker

We then show how Chemception can serve as a general-purpose neural network architecture for predicting toxicity, activity, and solvation properties when trained on a modest database of 600 to 40, 000 compounds.

Feature Engineering Image Classification +2

Paper
Code

Adaptive Neuron Apoptosis for Accelerating Deep Learning on Large Scale Systems

no code implementations • 3 Oct 2016 • Charles Siegel, Jeff Daily, Abhinav Vishnu

We present novel techniques to accelerate the convergence of Deep Learning algorithms by conducting low overhead removal of redundant neurons -- apoptosis of neurons -- which do not contribute to model learning, during the training phase itself.

General Classification

Paper
Add Code

Distributed TensorFlow with MPI

no code implementations • 7 Mar 2016 • Abhinav Vishnu, Charles Siegel, Jeffrey Daily

Machine Learning and Data Mining (MLDM) algorithms are becoming increasingly important in analyzing large volume of data generated by simulations, experiments and mobile devices.

Distributed, Parallel, and Cluster Computing

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.