Learning Gaussian Policies from Corrective Human Feedback (bibtex)
by Daan Wout, Jan Scholten, Carlos Celemin, and Jens Kober
Reference:
Daan Wout, Jan Scholten, Carlos Celemin, and Jens Kober. Learning Gaussian Policies from Corrective Human Feedback. arXiv:1903.05216 [cs.LG], 2019.
Bibtex Entry:
@Misc{Wout2019arXiv,
  author = {Wout, Daan and Scholten, Jan and Celemin, Carlos and Kober, Jens},
  title  = {Learning {G}aussian Policies from Corrective Human Feedback},
  year   = {2019},
  note   = {arXiv:1903.05216 [cs.LG]},
  file   = {https://arxiv.org/pdf/1903.05216.pdf},
  url    = {https://arxiv.org/abs/1903.05216},
}
Powered by bibtexbrowser