The proposal period for 2022 internships is now open
Submit your final proposal to Lorraine Chapman before Friday 18th March 2022
Browse the list below. One you have selected a project, we recommend that you contact the mentor to discuss your ideas. Full details of how the application process works are available in this blog. Also, see our Proposal Guidelines and an example to help you prepare your proposal submission. If you are new to HPCC Systems, find out more about here and view our tips on how to prepare for your project and internship.
The projects listed here are available as student work experience opportunities with HPCC Systems as part of our summer intern program. Submissions are welcome from high school through to PhD. Find out about our Class of 2021 and their projects.
Contact Lorraine Chapman with your questions about the program.
- Additional Embedded Languages in ECL
Clojure, Haskell, Julia, MariaDB, MatLab, MongoDB (Under Offer ), ODBC, Postgres, SAS, Scala, SQL, or suggest one! - Additional external data stores
Ceph or suggest one! - Address Cleaner Plugin Optimizations
- Build Process Improvements -Ninja, Jenkins X and Azure
- Cloud specific projects
HPCC Cluster Storage Support With Container Storage Interface (CSI)
Baseline test suite for an HPCC Systems cluster on Kubernetes
Explore Kubernetes Federation Support
HPCC Systems Kubernetes Cloud Namespace hierarchy
HPCC Systems machine learning cluster on Kubernetes
Enhance K8S Ingress with Istio Gateway and API Gateway
Istio and other Service Mesh with HPCC Systems cluster
Jenkins X
Performance test suite for an HPCC Systems cluster on Kubernetes - Develop an Automated ECL Watch Test Suite
- Marketing/Documentation Projects
Technology/Marketing and Branding
Documentation - Data Patterns
These projects are suitable for students who would like to get involved with the HPCC Systems Open Source Project by contributing in a way that does not involve coding in a programming language. - ECL Code Documentation Generator Improvements - No longer available
Make major improvements to the ECL Code Documentation Generator (ECLDoc), written in Python. - Investigate Test Frameworks and Best Practises for HPCC Systems Cloud Native
- Investigate Third Party Environments Working with HPCC Systems
- Locking engine to replace DALI - Investigative project
Research, test and do a POC of a 3rd party inter-machine/process locking engine, for example ZooKeeper, HashiCorp's Consul or other suitable contenders. - Machine Learning Algorithms on the HPCC Platform
Anomaly Detection Algorithms - No longer available
Applying the Causality Toolkit to Real World Datasets - Natural Language Processing
Enhance the English Dictionary
Enhance the full English phrase parser
Enhance the performance of the ECL NLP Plugin
Create an NLP dictionary for Portuguese
Create an NLP dictionary for Chinese
Create an NLP dictionary for Spanish
Create an NLP dictionary for the Kurdish language - Under Offer
Build knowledge base for the human body
Build an OCR cleanup analyzer
Create a phrase parser for Portuguese (Follow-on project- Requires the completion of the Portuguese dictionary project)
Create a phrase parser for Chinese (Follow-on project- Requires the completion of the Chinese dictionary project)
Create a phrase parser for Spanish (Follow-on project- Requires the completion of the Spanish dictionary project)
Sentiment Analysis in Chinese
Sentiment Analysis in English
Sentiment Analysis in Portuguese -
Sentiment Analysis in Spanish
Sentiment Analysis in the Kurdish Language - Performance Testing - Bare Metal vs Cloud Native
- Provide SELinux Policies for the HPCC-Platform installation on Linux environments
Build SELinux domains for hpccsystems-platform services. - Replace existing socket-based message passing interface with an open source package
Explore if using a different message layer (open-source package, such as ZeroMQ) offers improved performance, robustness and code maintainability - System self health check - Under Review. Please get in contact if you are interested in the project
Design and implement a tool to provide an overall check to everything is working as expected across components, from a button within ECL Watch
These projects are still under development and more details will be added soon. If you want to know more about any of these projects, view the associated JIRA issue and please contact Lorraine Chapman or the mentor of the project: