Page tree
Skip to end of metadata
Go to start of metadata

The proposal period for 2022 internships is now closed
The proposal period for 2023 internships will open in November 2022

This project is already taken for the 2022 intern program

This project is available as a student work experience opportunity with HPCC Systems this summer. Curious about other projects we are offering? Take a look at our Ideas List

Find out about the HPCC Systems Summer Internship Program.


Project Description

This project will be implemented in Enterprise Control Language (ECL) on the Big Data processing platform HPCC Systems. The expected result is an anomaly detection bundle with a user-friendly interface. The student will research existing anomaly detection methods and choose a few to implement in ECL and test the performance in HPCC Systems.  Student will collaboratively identify appropriate datasets for testing, and will produce tested production code implementing the selected algorithms. The selected algorithms should be supported by a research paper or publication. A widely adopted algorithm is preferred. After 12 week's hard work, we would like to have your work summarized in a white paper. In the previous internships, many project results are turned into paper publications in different conferences and journals including top venues. Although it's not required, we suggest student aim for it.


* Free ECL online training is available for students who don't have previous ECL programming experience. It's recommended to complete the training before the internship starts for a better internship experience.


Mentor

Lili Xu
Contact Details

Backup Mentor: Roger Dev
Contact Details 

Skills needed
  • Knowledge of anomaly detection algorithm
  • Knowledge of distributed computing techniques
  • Knowledge of HPCC Systems
  • Knowledge of ECL
  • Familiar with Github
Deliverables

Midterm

  • Implementation design
  • Test datasets ready
  • Complete 70% of the implementation
  • Code check-in on Github
  • Documentation

End of project

  • Complete 100% of the implementation
  • Performance testing
  • Documentation
  • Complete code check-in and code-review on Github
  • White paper/Publication
Other resources
  • No labels