Publications

2021

Agostinelli, Forest, Alexander Shmakov, Stephen McAleer, Roy Fox, and Pierre Baldi. 2021. “A* Search Without Expansions: Learning Heuristic Functions With Deep Q-Networks”. arXiv. https://doi.org/10.48550/ARXIV.2102.04518.

Publisher's Version

Chaudhuri, Sandeep K., Joshua W. Kleppinger, Ritwik Nag, Kaushik Roy, Rojina Panta, Forest Agostinelli, Amit Sheth, Utpal N. Roy, Ralph B. James, and Krishna C. Mandal. 2021. “A CdZnTeSe Gamma Spectrometer Trained by Deep Convolutional Neural Network for Radioisotope Identification”. Edited by Nerine J. Cherepy, Michael Fiederle, Ralph B. James, Nerine J. Cherepy, Michael Fiederle, and Ralph B. James. Hard X-Ray, Gamma-Ray, and Neutron Detector Physics XXIII. SPIE. https://doi.org/10.1117/12.2596456.

Publisher's Version

We report the implementation of a deep convolutional neural network to train a high-resolution room-temperature CdZnTeSe based gamma ray spectrometer for accurate and precise determination of gamma ray energies for radioisotope identification. The prototype learned spectrometer consists of a NI PCI 5122 fast digitizer connected to a pre-amplifier to recognize spectral features in a sequence of data. We used simulated preamplifier pulses that resemble actual data for various gamma photon energies to train a CNN on the equivalent of 90 seconds worth of data and validated it on 10 seconds worth of simulated data.

2019

Agostinelli, Forest. 2019. “Deep Learning for Puzzles and Circadian Rhythms”. Irvine: University of California.

The combination of deep learning with reinforcement learning and the application of deep learning to the sciences is a relatively new and flourishing field. We show how deep reinforcement learning techniques can learn to solve problems, often in the most efficient way possible, when faced with many possibilities but little information by designing an algorithm that can learn to solve seven different combinatorial puzzles, including the Rubik's cube. Furthermore, we show how deep learning can be applied to the field of circadian rhythms. Circadian rhythms are fundamental for all forms of life. Using deep learning, we can gain insight into circadian rhythms on the molecular level. Finally, we propose new deep learning algorithms that yield significant performance improvements on computer vision and high energy physics tasks.

Agostinelli, Forest, Stephen McAleer, Alexander Shmakov, and Pierre Baldi. 2019. “Solving the Rubik’s Cube With Deep Reinforcement Learning and Search”. Nature Machine Intelligence.

McAleer, Stephen, Forest Agostinelli, Alexander Shmakov, and Pierre Baldi. 2019. “Solving the Rubik S Cube With Approximate Policy Iteration”. International Conference on Learning Representations.

Publisher's Version

2018

McAleer, Stephen, Forest Agostinelli, Alexander Shmakov, and Pierre Baldi. 2018. “Solving the Rubik’s Cube With Approximate Policy Iteration”. In International Conference on Learning Representations.

Recently, Approximate Policy Iteration (API) algorithms have achieved super-human proficiency in two-player zero-sum games such as Go, Chess, and Shogi without human data. These API algorithms iterate between two policies: a slow policy (tree search), and a fast policy (a neural network). In these two-player games, a reward is always received at the end of the game. However, the Rubik’s Cube has only a single solved state, and episodes are not guaranteed to terminate. This poses a major problem for these API algorithms since they rely on the reward received at the end of the game. We introduce Autodidactic Iteration: an API algorithm that overcomes the problem of sparse rewards by training on a distribution of states that allows the reward to propagate from the goal state to states farther away. Autodidactic Iteration is able to learn how to solve the Rubik’s Cube and the 15-puzzle without relying on human data. Our algorithm is able to solve 100% of randomly scrambled cubes while achieving a median solve length of 30 moves—less than or equal to solvers that employ human domain knowledge.

Agostinelli, Forest, Guillaume Hocquet, Sameer Singh, and Pierre Baldi. 2018. “From Reinforcement Learning to Deep Reinforcement Learning: An Overview”. In Braverman Readings in Machine Learning. Key Ideas from Inception to Current State, 298-328. Springer, Cham.

Ceglia, Nicholas, Yu Liu, Siwei Chen, Forest Agostinelli, Kristin Eckel-Mahan, Paolo Sassone-Corsi, and Pierre Baldi. 2018. “CircadiOmics: Circadian Omic Web Portal”. Nucleic Acids Research.

2016

Agostinelli, Forest, Nicholas Ceglia, and Pierre Baldi. 2016. “What Time Is It? Deep Learning Approaches for Circadian Rhythms”. Bioinformatics.

Shi, Tianlin, Forest Agostinelli, Matthew Staib, David Wipf, and Thomas Moscibroda. 2016. “Improving Survey Aggregation With Sparsely Represented Signals”. In SIGKDD Conference on Knowledge Discovery and Data Mining.

Publications

2021

2019

2018

2016

FOREST AGOSTINELLI, PHD

Navigation