The projects listed here are available as student work experience opportunities with HPCC Systems as part of our summer intern program and Google Summer of Code.
Find out more about the HPCC Systems Summer Intern Program.
- Additional Embedded Languages in ECL
Clojure, Haskell, MariaDB, MatLab, MongoDB, ODBC, Postgres, SAS, Scala, SQL, or suggest one!
- Additional external data stores
Ceph, S3 or suggest one!
- Cluster Deployment with Juju Charm
Convert our current implementation to use the new Charm Helpers framework (python) and add support for new HPCC components
- Create HPCC Systems VM on HyperV
More coming soon
- DFU Spray from zip/gzip files
Create a plugin for spraying from a ZIP/GZIP archive without decompressing the content
- Implement an IOT pluggable protocol for ROXIE
Add support for pluggable protocols currently being used in IOT projects
- Machine Learning Algorithms on the HPCC Platform
Anomaly Detection, Data Series Classification, Detecting fraud within stored-valued cards, Implement an approximate n-tile algorithm, Text Search Bundle TBC, Word Vectorization
- Port Roxie to a different UDP layer
Investigate how well the current implementation achieves the goal of getting data from slaves to serves as quickly and reliably as possible on today's systems, and investigate whether there is a third party library or alternative protocol that may be worth consideration.
- System self health check
Design and implement a tool to provide an overall check to everything is working as expected across components, from a button within ECL Watch
- Provide SELinux Policies for the HPCC-Platform installation on Linux environments
Build SELinux domains for hpccsystems-platform services.
- Locking engine to replace DALI - Investigative project
Research, test and do a POC of a 3rd party inter-machine/process locking engine, for example ZooKeeper, HashiCorp's Consul or other suitable contenders.
- Replace existing socket-based message passing interface with an open source package
Explore if using a different message layer (open-source package, such as ZeroMQ) offers improved performance, robustness and code maintainability
These projects are new for 2018. They are still under development and more details will be added soon. If you want to know more about any of these projects, view the associated JIRA issue and please contact Lorraine Chapman or the mentor of the project:
- Implement ECL Pretty Print
- Implement reference dafilesrv in other languages
- Implement a Reverse activity
- Incorporating self test code into a bundle
- Provide test code for bundles with no self test
- VS Code extension for DESDL and other languages
- Add Arrow support to dafilesrv
- Add ORC support to HPCC Systems
- Using HPCC Systems as a data lake for the Deep Cloud platform
- Applying HPCC Systems Word Vectors to SEC Filings