Page tree
Skip to end of metadata
Go to start of metadata

The projects listed here are available as student work experience opportunities with HPCC Systems this summer. Here's the list of available in alphabetical order. Some have multiple projects associated with them. You can also view the list by project type

Find out more about the HPCC Systems Summer Internship Program.
Deadline for proposals - Monday April 3rd 2017

  1. Additional Embedded Languages in ECL
    Scala, Haskell, Clojure, SAS, MatLab, MongoDB, Postgres, MariaDB or suggest one!
  2. Additional external data stores
    Ceph, S3 or suggest one!
  3. Analysing workunit performance
    Identify the most useful workunit statistics, analyse them and present them to users as visualizations within ECL Watch
  4. Cluster Deployment with Juju Charm
    Convert our current implementation to use the new Charm Helpers framework (python) and add support for new HPCC components 
  5. DFU Spray from zip/gzip files
    Create a plugin for spraying from a ZIP/GZIP archive without decompressing the content
  6. Implement a global sort and distribution optimiser
    Optimize the use of sorts and distributions by looking at the entire graph and tracking which sorts and distributions are actually used by downstream activities.
  7. Implement an IOT pluggable protocol for ROXIE
    Add support for pluggable protocols currently being used in IOT projects
  8. Provide Unicode implementations for HPCC Systems standard library functions
    Improvement the way HPCC Systems handles unstructured text
  9. Log Visualisation Tool
    Create visualizations of the top counts for specific types of issues within a log file, showing severity and details
  10. Machine Learning Algorithms on the HPCC Platform
    Approximate n-tile, Gradient Trees, Text Search Bundle, Documentation Generator for ECL Code
  11. Implement a Jupyter kernel for HPCC/ECL
    Implement a kernel to enable the embedding/execution of source code and displaying the results
  12. MPI Proof of Concept
    Replace existing socket-based message passing api with an open-source MPI 
  13. Text Search Bundle
  14. Port Roxie to a different UDP layer
  15. Investigate how well the current implementation achieves the goal of getting data from slaves to serves as quickly and reliably as possible on today's systems, and investigate whether there is a third party library or alternative protocol that may be worth consideration.
  16. Continuous Integration of roxie query / data deployments using Jenkins
  17. System self health check
    Design and implement a tool to provide an overall check to everything is working as expected across components, from a button within ECL Watch
  18. Provide SELinux Policies for the HPCC-Platform installation on Linux environments
  19. Build SELinux domains for hpccsystems-platform services.



  • No labels