Building Intelligent Autonomous Navigation Agents

Chaplot, Devendra

doi:10.1184/R1/19552297.v1

dchaplot_phd_mld_2021.pdf (23.43 MB)

Building Intelligent Autonomous Navigation Agents

thesis

posted on 2022-04-21, 19:59 authored by Devendra ChaplotDevendra Chaplot

Breakthroughs in machine learning in the last decade have led to `digital intelligence', i.e. machine learning models capable of learning from vast amounts of labeled data to perform several digital tasks such as speech recognition,

face recognition, machine translation and so on. The goal of this thesis is to make progress towards designing algorithms capable of `physical intelligence', i.e. building intelligent autonomous navigation agents capable of learning to

perform complex navigation tasks in the physical world involving visual perception, natural language understanding, reasoning, planning, and sequential decision making. Despite several advances in classical navigation methods in

the last few decades, current navigation agents struggle at long-term semantic navigation tasks. In the first part of the thesis, we discuss our work on shortterm navigation using end-to-end reinforcement learning to tackle challenges

such as obstacle avoidance, semantic perception, language grounding, and reasoning In the second part, we present a new class of navigation methods based on modular learning and structured explicit map representations, which leverage

the strengths of both classical and end-to-end learning methods, to tackle long-term navigation tasks. We show that these methods are able to effectively tackle challenges such as localization, mapping, long-term planning, exploration

and learning semantic priors. These modular learning methods are capable of long-term spatial and semantic understanding and achieve state-of-the-art results

on various navigation tasks.

History

Date

2021-03-03

Degree Type

Dissertation

Department

Machine Learning

Degree Name

Doctor of Philosophy (PhD)

Advisor(s)

Ruslan Salakhutdinov

Usage metrics

Keywords

visual navigation exploration deep reinforcement learning

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Building Intelligent Autonomous Navigation Agents

History

Date

Degree Type

Department

Degree Name

Advisor(s)

Usage metrics

Categories

Keywords

Licence

Exports