MACHINE LEARNING MODEL FOR CLASSIFYING THE FRENCH VERB POUVOIR

First page of the conference article

Project description

The Modal Verbs Modality Detector project focuses on creating a machine learning model to identify and analyze modal verbs' usage in text. This endeavor is rooted in Natural Language Processing and involves a systematic approach to gather data, train the model, and evaluate its performance.

The project's primary objective is to build a reliable model that can detect modal verbs and determine their modality, which includes understanding their function in expressing necessity, possibility, permission, or obligation. The methodology begins with data collection, where a diverse set of texts is compiled to ensure the model can generalize well across different contexts. This corpus is meticulously annotated, marking the modal verbs and their respective modalities.

The team utilizes advanced NLP techniques and leverages various machine learning algorithms to train the model. The training process involves feature extraction, where linguistic features relevant to modality are identified and used to train the model. These features might include the position of the modal verb in a sentence, its surrounding words, and syntactic structures.

A significant part of the project is dedicated to fine-tuning the model to achieve high accuracy. The team experiments with different algorithms, adjusts hyperparameters, and employs cross-validation techniques to prevent overfitting. The performance of the model is rigorously evaluated using metrics such as precision, recall, and F1-score to ensure it accurately detects modal verbs and their modality in unseen texts.

The results of the project are promising, indicating that the model can effectively identify modal verbs and classify their modality with a high degree of accuracy. The team acknowledges the challenges encountered, such as dealing with ambiguous cases where the context heavily influences the interpretation of modality. However, the solutions and strategies developed during the project provide valuable insights for future research in this area.

The culmination of this work is presented in an article that is available in the project repository under the name COLLI_ROSSINI_BATTISTELLI. This article details the entire process, from data collection and annotation to model training and evaluation. It serves as a comprehensive guide for understanding the intricacies of building a modality detector and offers a foundation for further advancements in NLP and modality detection.

Discover more about this project and click on the button below to access the GitHub Repository.

Find out more on this project

Explore More Projects

If you're interested in exploring more projects, please select another project from the dropdown menu.