Vladimir SteinerJun 29, 20203 min readEfficient and Scalable Bayesian Neural Nets with Rank-1 FactorHey everyone, new blog post today, let's get right to it ! This article was written by Michael Dusenberry, Ghassen Jerfel, Yeming Wen,...
Vladimir SteinerApr 29, 20203 min readAdapting a Robot’s Linguistic Style Based on Socially-Aware Reinforcement LearningHi everyone! Today we are back for a paper on understanding human interest in a discussion! This article is about creating a robot whose...
Vladimir SteinerMar 20, 20203 min readEnriching Word Vectors with Subword InformationHi there ! I will try to make a summary of an article as always, but this time i'll be giving my opinion afterwards. This article,...
Vladimir SteinerJan 22, 20203 min readOne-Shot Imitation LearningI am doing research for a projet of mine and was greatly interested in this article from OpenAI and UC Berkeley. It talks about one-shot...
Vladimir SteinerMar 1, 20193 min readPolicy Gradients (Actor-Critic methods and DDPG) - The Recent Evolution of Reinforcement Learning p3Today, we'll continue our talk about policy gradients. I want to insist on the fact that there is an enormous amount of research...
Vladimir SteinerFeb 27, 20193 min readPolicy Gradients (PG, DPG) - The Recent Evolution of Reinforcement Learning p2We are going to talk about one of the essential parts of reinforcement learning, policy gradients. First we need to explicit what is a...
Vladimir SteinerFeb 22, 20193 min readDeepMind's Deep Q-network - The Recent Evolution of Reinforcement Learning p1Before finishing the set of articles about Improving Language Understanding with Unsupervised Learning, I wanted to talk about what I...
Vladimir SteinerDec 17, 20183 min readUnderstanding "Improving Language Understanding with Unsupervised Learning" Part 2I decided to try and understand one of OpenAI's latest research (from last June). But to do so, we must first read and assimilate the...
Vladimir SteinerNov 26, 20183 min readUnderstanding "Improving Language Understanding with Unsupervised Learning" Part 1I decided to try and understand one of OpenAI's latest research (from last June). But to do so, we must first read and assimilate the...
Vladimir SteinerNov 5, 20183 min readDiscovering Reinforcement Learning with Ms. PacManThe most fascinating domain in the already fascinating machine learning sector is in my opinion, Reinforcement Learning (RL). It became...