Menu principal

Commonsense Reasoning For Question Answering

Commonsense is a skill every human has but that is hard to get for computers. A simple observation can convince us: When we write a text, we rarely state the obvious, what is commonsense. For example, we will rarely say that at night, the sun is not visible!

We can divide the problem of commonsense into two parts. First, there is commonsense knowledge, i.e. statements that we intuitively know are true. For example, the fact that elephants have a trunk. It is opposed to encyclopedic knowledge that is acquired by studying. For example, we learn that Paris is the capital of France at school. The second part is commonsense reasoning, i.e. using reasoning over commonsense knowledge. This kind of reasoning is particularly useful when it comes to question-answering. For example, to the question ``where would I not want a fox?'', I could answer a hen house as foxes eat hens and hens are found in hen houses.

The goal of this project is to study the limitations of the current approaches. In particular, we will be interested in the CommonsenseQA dataset. State-of-the-art algorithms rely on a knowledge base called ConceptNet. This is a problem for several reasons:

* CommonsenseQA is partially built from ConceptNet, which biased the results.
* It is not clear if the approaches would generalize to other knowledge bases.
* They rely on a clear path between the question and the answer in the knowledge graph.

In the first part of this internship, we will study the existing methods such as MHGRN or QA-GNN. We will compare them by changing the knowledge base they use to see how they generalize. Then, we will see how we can leverage the weaknesses to propose a new approach.


Mots-clés
Analyse de données; apprentissage automatique; artificial intelligence
Établissement
Institut Mines Telecom - Telecom SudParis
91120 Palaiseau  
Site Web
http://www.madics.fr/wp-content/uploads/offresEmplois/202201170857_internship_commonsense_reasoning.pdf
Date de début souhaitée
01/03/2022
Langues obligatoires
Anglais; Français
Niveau
Bac +5
Prérequis

* English (French can be useful for daily life).
* Good knowledge of Python.
* Experience with machine learning and deep learning, in particular with frameworks like Pytorch.
* Basic knowledge about knowledge bases/ontologies.

Durée
6 mois
Indemnité
591 € par mois
Date limite
01/06/2022
Informations de contact

Julien Romero, julien.romero@telecom-sudparis.eu