Coverart for item
The Resource Recent advances in reinforcementlLearning : 9th European Workshop, EWRL 2011, Athens, Greece, September 9-11, 2011, Revised selected papers, Scott Sanner, Marcus Hutter (eds.), (electronic book)

Recent advances in reinforcementlLearning : 9th European Workshop, EWRL 2011, Athens, Greece, September 9-11, 2011, Revised selected papers, Scott Sanner, Marcus Hutter (eds.), (electronic book)

Label
Recent advances in reinforcementlLearning : 9th European Workshop, EWRL 2011, Athens, Greece, September 9-11, 2011, Revised selected papers
Title
Recent advances in reinforcementlLearning
Title remainder
9th European Workshop, EWRL 2011, Athens, Greece, September 9-11, 2011, Revised selected papers
Statement of responsibility
Scott Sanner, Marcus Hutter (eds.)
Title variation
EWRL 2011
Creator
Contributor
Subject
Genre
Language
eng
Member of
Cataloging source
GW5XE
Dewey number
006.3/1
Index
index present
LC call number
Q325.6
LC item number
.E97 2011
Literary form
non fiction
http://bibfra.me/vocab/lite/meetingDate
2011
http://bibfra.me/vocab/lite/meetingName
EWRL 2011
Nature of contents
  • dictionaries
  • bibliography
http://library.link/vocab/relatedWorkOrContributorName
  • Sanner, Scott
  • Hutter, Marcus
Series statement
  • Lecture notes in artificial intelligence
  • Lecture notes in computer science,
  • LNCS sublibrary. SL 7, Artificial intelligence
Series volume
7188
http://library.link/vocab/subjectName
Reinforcement learning
Label
Recent advances in reinforcementlLearning : 9th European Workshop, EWRL 2011, Athens, Greece, September 9-11, 2011, Revised selected papers, Scott Sanner, Marcus Hutter (eds.), (electronic book)
Instantiates
Publication
Antecedent source
unknown
Bibliography note
Includes bibliographical references and author index
Color
multicolored
Contents
  • Automatic Discovery of Ranking Formulas for Playing with Multi-armed Bandits
  • Francis Maes, Louis Wehenkel and Damien Ernst
  • Goal-Directed Online Learning of Predictive Models
  • Sylvie C. W. Ong, Yuri Grinberg and Joelle Pineau
  • Gradient Based Algorithms with Loss Functions and Kernels for Improved On-Policy Control
  • Matthew Robards and Peter Sunehag
  • Active Learning of MDP Models
  • Mauricio Araya-López, Olivier Buffet, Vincent Thomas and François Charpillet
  • Handling Ambiguous Effects in Action Learning
  • Boris Lesner and Bruno Zanuttini
  • Invited Talk: UCRL and Autonomous Exploration
  • Feature Reinforcement Learning in Practice
  • Phuong Nguyen, Peter Sunehag and Marcus Hutter
  • Reinforcement Learning with a Bilinear Q Function
  • Charles Elkan
  • l1-Penalized Projected Bellman Residual
  • Matthieu Geist and Bruno Scherrer
  • Peter Auer
  • Invited Talk: Increasing Representational Power and Scaling Inference in Reinforcement Learning
  • Kristian Kersting
  • Invited Talk: PRISM - Practical RL: Representation, Interaction, Synthesis, and Mortality
  • Peter Stone
  • Invited Talk: Towards Robust Reinforcement Learning Algorithms
  • Csaba Szepesvári
  • Unified Inter and Intra Options Learning Using Policy Gradient Methods
  • Kfir Y. Levy and Nahum Shimkin
  • Options with Exceptions
  • Munu Sairamesh and Balaraman Ravindran
  • Robust Bayesian Reinforcement Learning through Tight Lower Bounds
  • Christos Dimitrakakis
  • Optimized Look-ahead Tree Search Policies
  • Francis Maes, Louis Wehenkel and Damien Ernst
  • A Framework for Computing Bounds for the Return of a Policy
  • Cosmin Păduraru, Doina Precup and Joelle Pineau
  • Regularized Least Squares Temporal Difference Learning with Nested l2 and l1 Penalization
  • Transferring Evolved Reservoir Features in Reinforcement Learning Tasks
  • Kyriakos C. Chatzidimitriou, Ioannis Partalas, Pericles A. Mitkas and Ioannis Vlahavas
  • Matthew W. Hoffman, Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos
  • Recursive Least-Squares Learning with Eligibility Traces
  • Bruno Scherrer and Matthieu Geist
  • Value Function Approximation through Sparse Bayesian Modeling
  • Nikolaos Tziortziotis and Konstantinos Blekas
  • Automatic Construction of Temporally Extended Actions for MDPs Using Bisimulation Metrics
  • Pablo Samuel Castro and Doina Precup
  • Bayesian Multitask Inverse Reinforcement Learning
  • Christos Dimitrakakis and Constantin A. Rothkopf
  • Batch, Off-Policy and Model-Free Apprenticeship Learning
  • Edouard Klein, Matthieu Geist and Olivier Pietquin
  • Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot
  • Seiya Kuroda, Kazuteru Miyazaki and Hiroaki Kobayashi
  • MapReduce for Parallel Reinforcement Learning
  • Yuxi Li and Dale Schuurmans
  • Compound Reinforcement Learning: Theory and an Application to Finance
  • Tohgoroh Matsui, Takashi Goto, Kiyoshi Izumi and Yu Chen
  • Transfer Learning via Multiple Inter-task Mappings
  • Proposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning
  • Kazuteru Miyazaki and Masaaki Ida
  • Anestis Fachantidis, Ioannis Partalas, Matthew E. Taylor and Ioannis Vlahavas
  • Multi-Task Reinforcement Learning: Shaping and Feature Selection
  • Matthijs Snel and Shimon Whiteson
  • Transfer Learning in Multi-Agent Reinforcement Learning Domains
  • Georgios Boutsioukis, Ioannis Partalas and Ioannis Vlahavas
  • An Extension of a Hierarchical Reinforcement Learning Algorithm for Multiagent Settings
  • Ioannis Lambrou, Vassilis Vassiliades and Chris Christodoulou
Control code
SPR794223060
Dimensions
unknown
Extent
1 online resource (xiii, 344 p.)
File format
unknown
Form of item
online
Isbn
9783642299469
Level of compression
unknown
Quality assurance targets
not applicable
Reformatting quality
unknown
Reproduction note
Electronic resource.
Sound
unknown sound
Specific material designation
remote
Label
Recent advances in reinforcementlLearning : 9th European Workshop, EWRL 2011, Athens, Greece, September 9-11, 2011, Revised selected papers, Scott Sanner, Marcus Hutter (eds.), (electronic book)
Publication
Antecedent source
unknown
Bibliography note
Includes bibliographical references and author index
Color
multicolored
Contents
  • Automatic Discovery of Ranking Formulas for Playing with Multi-armed Bandits
  • Francis Maes, Louis Wehenkel and Damien Ernst
  • Goal-Directed Online Learning of Predictive Models
  • Sylvie C. W. Ong, Yuri Grinberg and Joelle Pineau
  • Gradient Based Algorithms with Loss Functions and Kernels for Improved On-Policy Control
  • Matthew Robards and Peter Sunehag
  • Active Learning of MDP Models
  • Mauricio Araya-López, Olivier Buffet, Vincent Thomas and François Charpillet
  • Handling Ambiguous Effects in Action Learning
  • Boris Lesner and Bruno Zanuttini
  • Invited Talk: UCRL and Autonomous Exploration
  • Feature Reinforcement Learning in Practice
  • Phuong Nguyen, Peter Sunehag and Marcus Hutter
  • Reinforcement Learning with a Bilinear Q Function
  • Charles Elkan
  • l1-Penalized Projected Bellman Residual
  • Matthieu Geist and Bruno Scherrer
  • Peter Auer
  • Invited Talk: Increasing Representational Power and Scaling Inference in Reinforcement Learning
  • Kristian Kersting
  • Invited Talk: PRISM - Practical RL: Representation, Interaction, Synthesis, and Mortality
  • Peter Stone
  • Invited Talk: Towards Robust Reinforcement Learning Algorithms
  • Csaba Szepesvári
  • Unified Inter and Intra Options Learning Using Policy Gradient Methods
  • Kfir Y. Levy and Nahum Shimkin
  • Options with Exceptions
  • Munu Sairamesh and Balaraman Ravindran
  • Robust Bayesian Reinforcement Learning through Tight Lower Bounds
  • Christos Dimitrakakis
  • Optimized Look-ahead Tree Search Policies
  • Francis Maes, Louis Wehenkel and Damien Ernst
  • A Framework for Computing Bounds for the Return of a Policy
  • Cosmin Păduraru, Doina Precup and Joelle Pineau
  • Regularized Least Squares Temporal Difference Learning with Nested l2 and l1 Penalization
  • Transferring Evolved Reservoir Features in Reinforcement Learning Tasks
  • Kyriakos C. Chatzidimitriou, Ioannis Partalas, Pericles A. Mitkas and Ioannis Vlahavas
  • Matthew W. Hoffman, Alessandro Lazaric, Mohammad Ghavamzadeh and Rémi Munos
  • Recursive Least-Squares Learning with Eligibility Traces
  • Bruno Scherrer and Matthieu Geist
  • Value Function Approximation through Sparse Bayesian Modeling
  • Nikolaos Tziortziotis and Konstantinos Blekas
  • Automatic Construction of Temporally Extended Actions for MDPs Using Bisimulation Metrics
  • Pablo Samuel Castro and Doina Precup
  • Bayesian Multitask Inverse Reinforcement Learning
  • Christos Dimitrakakis and Constantin A. Rothkopf
  • Batch, Off-Policy and Model-Free Apprenticeship Learning
  • Edouard Klein, Matthieu Geist and Olivier Pietquin
  • Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot
  • Seiya Kuroda, Kazuteru Miyazaki and Hiroaki Kobayashi
  • MapReduce for Parallel Reinforcement Learning
  • Yuxi Li and Dale Schuurmans
  • Compound Reinforcement Learning: Theory and an Application to Finance
  • Tohgoroh Matsui, Takashi Goto, Kiyoshi Izumi and Yu Chen
  • Transfer Learning via Multiple Inter-task Mappings
  • Proposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning
  • Kazuteru Miyazaki and Masaaki Ida
  • Anestis Fachantidis, Ioannis Partalas, Matthew E. Taylor and Ioannis Vlahavas
  • Multi-Task Reinforcement Learning: Shaping and Feature Selection
  • Matthijs Snel and Shimon Whiteson
  • Transfer Learning in Multi-Agent Reinforcement Learning Domains
  • Georgios Boutsioukis, Ioannis Partalas and Ioannis Vlahavas
  • An Extension of a Hierarchical Reinforcement Learning Algorithm for Multiagent Settings
  • Ioannis Lambrou, Vassilis Vassiliades and Chris Christodoulou
Control code
SPR794223060
Dimensions
unknown
Extent
1 online resource (xiii, 344 p.)
File format
unknown
Form of item
online
Isbn
9783642299469
Level of compression
unknown
Quality assurance targets
not applicable
Reformatting quality
unknown
Reproduction note
Electronic resource.
Sound
unknown sound
Specific material designation
remote

Library Locations

Processing Feedback ...