The proposal period for 2022 internships is now closed
The proposal period for 2023 internships will open in November 2022

This is new project, more information coming soon. If you are interested in this project contact Lorraine Chapman

Find out about the HPCC Systems Summer Internship Program.

Project Description

In order to eventually create digital human readers in Spanish, a dictionary must be established. This project will use the Spanish dictionary from Wiktionary. One interesting aspect of this project are the verbs in Spanish which have a rich morphology.

If you are interested in this project, please contact Add email link to mentor.

Completion of this project involves:

By the mid term review we would expect you to have:

Mentor

David de Hilster
david.dehilster@lexisnexisrisk.com

Backup Mentor: Add Backup Mentor Name
Add link to Email Address 

Skills needed
  • Keen interest in natural language
  • Ability to learn and program in NLP++
  • Ability to create test cases
  • Ability to write test code in ECL using the NLP++ plugin to test the enhanced dictionary
Deliverables

Midterm

  • Parts-of-speech text files

End of project

  • A Spanish dictionary repository in the VisualText open source github including the dictionary files and  NLP++ analyzers
Other resources