
Adapting a Robot’s Linguistic Style Based on Socially-Aware Reinforcement Learning
Hi everyone! Today we are back for a paper on understanding human interest in a discussion! This article is about creating a robot whose...

Enriching Word Vectors with Subword Information
Hi there ! I will try to make a summary of an article as always, but this time i'll be giving my opinion afterwards. This article,...

One-Shot Imitation Learning
I am doing research for a projet of mine and was greatly interested in this article from OpenAI and UC Berkeley. It talks about one-shot...

Policy Gradients (Actor-Critic methods and DDPG) - The Recent Evolution of Reinforcement Learning p3
Today, we'll continue our talk about policy gradients. I want to insist on the fact that there is an enormous amount of research...

Policy Gradients (PG, DPG) - The Recent Evolution of Reinforcement Learning p2
We are going to talk about one of the essential parts of reinforcement learning, policy gradients. First we need to explicit what is a...

DeepMind's Deep Q-network - The Recent Evolution of Reinforcement Learning p1
Before finishing the set of articles about Improving Language Understanding with Unsupervised Learning, I wanted to talk about what I...

Understanding "Improving Language Understanding with Unsupervised Learning" Part 2
I decided to try and understand one of OpenAI's latest research (from last June). But to do so, we must first read and assimilate the...

Understanding "Improving Language Understanding with Unsupervised Learning" Part 1
I decided to try and understand one of OpenAI's latest research (from last June). But to do so, we must first read and assimilate the...

Discovering Reinforcement Learning with Ms. PacMan
The most fascinating domain in the already fascinating machine learning sector is in my opinion, Reinforcement Learning (RL). It became...









