The proposal period for 2022 internships is now closed
The proposal period for 2023 internships will open in November 2022

Student work experience opportunities also exist for students who want to suggest their own project idea. Project suggestions must be relevant to HPCC Systems and of benefit to our open source community. 

Find out about the HPCC Systems Summer Internship Program.

Project Description

This project requires at least some basic knowledge of HPCC Platform and test methodology. Current HPCC Platform has regression test suite https://github.com/hpcc-systems/HPCC-Platform/tree/master/testing/regress and performance test https://github.com/hpcc-systems/PerformanceTesting on bare-metal setup. This project is to adopt these tests to cloud environment mainly focus on benchmark type of measurement of Roxie and Thor jobs in various setup, such as cloud environment, storage types, Roxie and Thor targets size, Kubernetes Node size as well  etc.

The code can be developed and tested in local Kubernetes and real measurement will be conducted primarily on Azure and optionally on AWS.

Here are some dimensions for the test:

Additional considerations about the project

A github project should be created to host all files and documentation.

Student will work closely with our build and test team.

If you are interested in this project, please contact Contact Details

Completion of this project involves:

By the mid term review we would expect you to have:

Mentor

Xiaoming Wang
Contact Details

Backup Mentor: Godson Fortil Godson.Fortil@lexisnexisrisk.com, Turlapathi, Krishna Krishna.Turlapathi@lexisnexisrisk.com
Contact Details

Skills needed
  • General Cloud Environment knowledge such as Azure, AWS and GCP, Kubernetes and Docker
  • Unix Shell, Python and PowerShell
  • Ability to write test code. Knowledge of ECL is not a requirement since it should be possible to re-use existing code with minimal changes for this purpose. Links are provided below to our ECL training documentation and online courses should you wish to become familiar with the ECL  language.
Deliverables

Midterm

  • A github project with design and initial code implementation
  • Basic setup and measurement on Azure. 

End of project

Complete github project with documenatation.

Finish measurements for Azure and AWS.

Other resources