
浏览量:53
Timothy P. Lillicrap
Centre for Neuroscience Studies, Queen's University, Kingston, Ontario, Canada, K7L 3N6
Login to view more

论文共 67 篇
An investigation of model-free planning.
Dendritic solutions to the credit assignment problem
Meta-Learning Neural Bloom Filters.
Composing Entropic Policies using Divergence Correction.
Noise Contrastive Priors for Functional Uncertainty.
Deep reinforcement learning with relational inductive biases.
Is coding a relevant metaphor for building AI?
Fast Parametric Learning with Activation Memorization.
Recall Traces: Backtracking Models for Efficient Reinforcement Learning.
Distributed Distributional Deterministic Policy Gradients.
Measuring abstract reasoning in neural networks.
Relational recurrent neural networks.
Measuring abstract reasoning in neural networks.
Relational Deep Reinforcement Learning.
Optimizing Agent Behavior over Long Time Scales by Transporting Value.
Experience Replay for Continual Learning.
Learning Attractor Dynamics for Generative Memory.
Vector-based navigation using grid-like representations in artificial agents.
Vector-based navigation using grid-like representations in artificial agents.
Optimizing Agent Behavior over Long Time Scales by Transporting Value.
Deep Learning with Dynamic Spiking Neurons and Fixed Feedback Weights.
Learning to Learn without Gradient Descent by Gradient Descent.
Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates.
Mastering the game of Go with deep neural networks and tree search.
Asynchronous Methods for Deep Reinforcement Learning.
Matching Networks for One Shot Learning.
Meta-Learning with Memory-Augmented Neural Networks.
Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes.
Multielectrode Arrays Single-Unit Stability Using Chronically Implanted
AreaVersus Limb Trajectory in Dorsal Premotor Preferential Representation of Instructed Target
Nonhuman Primates Kinematics and Kinetics of Multijoint Reaching in
and During Reaching Movement Control of Hand Impedance Under Static Conditions
Continuous control with deep reinforcement learning
Learning Continuous Control Policies by Stochastic Value Gradients
An alternative to explicit divisive normalization models
Complex Spatiotemporal Tuning in Human Upper-Limb Muscles
Why copy others? Insights from the social learning tournament
Relevance Realization and the Emerging Framework in Cognitive Science
Temporal Evolution of "Automatic Gain-Scaling"
Learning sensitivity derivative by implicit supervision
Unsupervised learning is crucial to learning the names of objects