Page tree
Skip to end of metadata
Go to start of metadata

The proposal period for 2022 internships is now closed
The proposal period for 2023 internships will open in November 2022

This is new project, more information coming soon. If you are interested in this project contact Lorraine Chapman. This is a follow-on project which requires the corresponding NLP dictionary to be completed first.

Find out about the HPCC Systems Summer Internship Program.

Project Description

Once a Portuguese dictionary has been created, a phrase parser can be implemented. A phrase parser involves taking in Portuguese text and parsing it linguistically into noun phrases, verb phrases, and other phrases.

If you are interested in this project, please contact Add email link to mentor.

Completion of this project involves:

  • Become familiar with the Portuguese dictionary
  • Become familiar with NLP++ and VisualText
  • Come up with a strategy for parsing phrases in Portuguese
  • Implement the phrase parser in NLP++
  • Run the phrase parser using the NLP++ ECL Plugin
  • Create an NLP++ repository for the full Portuguese Parser

By the mid term review we would expect you to have:

  • A design for parsing Portuguese into phrases
Mentor

David de Hilster
david.dehilster@lexisnexisrisk.com

Backup Mentor: TBA

Skills needed
  • Keen interest in natural language
  • Ability to do research on the internet
  • Ability to learn and program in NLP++
  • Ability to write test code in ECL using the NLP++ plugin to test the enhanced dictionary
Deliverables

Midterm

  • Proposal on how the dictionary is to be constructed

End of project

  • A Portuguese parser repository in the VisualText open source github
Other resources
  • No labels