This project is already taken and is not available for the 2023 HPCC Systems Intern Program

Find out about the HPCC Systems Summer Internship Program.

Project Description

Create a knowledge base using NLP++ and the conceptual grammar for the human body. This includes a "part-of" hierarchy for the human body, an accompanying vocabulary. This will allow for processing of medical texts.

If you are interested in this project, please contact David Dehilster.

Completion of this project involves:

  • Become familiar with NLP++ and VisualText
  • Research and find medical data that can be used to create a KB and dictionary
  • Implement NLP++ parsers to ingest and create a KB and dictionary for the human body
  • Run the KB and dictionary on medical texts using the NLP++ ECL plugin
  • Create a VisualText repository with the Human Body KB and dictionary

By the mid term review we would expect you to have:

  • <What must be completed to pass the evaluation and continue on to complete the project>
Mentor

David Dehilster

Skills needed
  • Keen interest in natural language
  • Ability to do research on the internet
  • Ability to learn and program in NLP++
  • Ability to write test code in ECL using the NLP++ plugin to test the KB and dictionary
Deliverables

Midterm

  • More details coming soon

End of project

  • More details coming soon
Other resources
  • No labels