Vladimir Steiner

Jun 29, 20203 min read

Efficient and Scalable Bayesian Neural Nets with Rank-1 Factor

Hey everyone, new blog post today, let's get right to it ! This article was written by Michael Dusenberry, Ghassen Jerfel, Yeming Wen,...

Vladimir Steiner

Apr 29, 20203 min read

Adapting a Robot’s Linguistic Style Based on Socially-Aware Reinforcement Learning

Hi everyone! Today we are back for a paper on understanding human interest in a discussion! This article is about creating a robot whose...

Vladimir Steiner

Mar 20, 20203 min read

Enriching Word Vectors with Subword Information

Hi there ! I will try to make a summary of an article as always, but this time i'll be giving my opinion afterwards. This article,...

Vladimir Steiner

Jan 22, 20203 min read

One-Shot Imitation Learning

I am doing research for a projet of mine and was greatly interested in this article from OpenAI and UC Berkeley. It talks about one-shot...

Vladimir Steiner

Mar 1, 20193 min read

Policy Gradients (Actor-Critic methods and DDPG) - The Recent Evolution of Reinforcement Learning p3

Today, we'll continue our talk about policy gradients. I want to insist on the fact that there is an enormous amount of research...

Vladimir Steiner

Feb 27, 20193 min read

Policy Gradients (PG, DPG) - The Recent Evolution of Reinforcement Learning p2

We are going to talk about one of the essential parts of reinforcement learning, policy gradients. First we need to explicit what is a...

Vladimir Steiner

Feb 22, 20193 min read

DeepMind's Deep Q-network - The Recent Evolution of Reinforcement Learning p1

Before finishing the set of articles about Improving Language Understanding with Unsupervised Learning, I wanted to talk about what I...

Vladimir Steiner

Dec 17, 20183 min read

Understanding "Improving Language Understanding with Unsupervised Learning" Part 2

I decided to try and understand one of OpenAI's latest research (from last June). But to do so, we must first read and assimilate the...

Vladimir Steiner

Nov 26, 20183 min read

Understanding "Improving Language Understanding with Unsupervised Learning" Part 1

I decided to try and understand one of OpenAI's latest research (from last June). But to do so, we must first read and assimilate the...

Vladimir Steiner

Nov 5, 20183 min read

Discovering Reinforcement Learning with Ms. PacMan

The most fascinating domain in the already fascinating machine learning sector is in my opinion, Reinforcement Learning (RL). It became...

Accueil: Blog2

Accueil: S'abonner

Efficient and Scalable Bayesian Neural Nets with Rank-1 Factor

Adapting a Robot’s Linguistic Style Based on Socially-Aware Reinforcement Learning

Enriching Word Vectors with Subword Information

One-Shot Imitation Learning

Policy Gradients (Actor-Critic methods and DDPG) - The Recent Evolution of Reinforcement Learning p3

Policy Gradients (PG, DPG) - The Recent Evolution of Reinforcement Learning p2

DeepMind's Deep Q-network - The Recent Evolution of Reinforcement Learning p1

Understanding "Improving Language Understanding with Unsupervised Learning" Part 2

Understanding "Improving Language Understanding with Unsupervised Learning" Part 1

Discovering Reinforcement Learning with Ms. PacMan

Formulaire d'abonnement