Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Image Added

Atreya Bain is studying for a Bachelor of Computer Science and Engineering at the RV College of Engineering, Bengaluru, India.

Atreya joined the HPCC Systems Intern Program in 2021 to provide some improvements to the HSQL (HPCC Systems Structured Query Language) project. Atreya was a member of the team who created the HSQL project in 2020, as part of an academic collaboration between HPCC Systems and RVCE, under the supervision of Dr Shobha G and Arjuna Chala (Senior Director, Operations, LexisNexis Risk Solutions Group). Atreya's internship involved implementing the following improvements:

  • Define an initial syntax set for HSQL
  • Provide a working compiler that can convert HSQL to ECL
  • Provide a VSCode extension for use with HSQL

As well as the resources included here, read Atreya's intern blog journal which includes a more in depth look of his work during his 2021 internship. To learn more about the initial work carried out on this project, view the poster Atreya entered into our 2020 Poster Contest

Poster Abstract

Big Data has become an important field, and there is a steep learning curve to getting used to handling Big Data, especially in distributed systems. HSQL for HPCC Systems is a solution that is developed for allowing users to get used to its architecture and the ECL (Enterprise Control Language) language with which it primarily operates. HSQL aims to provide a seamless interface for data science developers to use, for working with data. It is designed to work in conjunction with ECL, the primary programming language for HPCC Systems, and should prove to be easy to work with and robust for general purpose analysis.

HSQL is made to provide a compact and easy to comprehend SQL-like syntax for performing visualizations, general exploratory data analysis, training of Machine Learning models while also allowing a modular structure to such programs. Functions can also be written to allow for code reuse. It can also integrate with VSCode IDE and provide Syntax Highlighting and Code Completion features.

In previous work on HSQL, the primary foundations were set and in this work, various improvements were made to make it more usable and correct as a compiler. The architecture of the compiler has received changes that allow it to translate more effectively and the newer version of the compiler brings in support for functions, for code reusability, and modules that help structure code. Additionally, a lot of the existing statements have received new features that make them easier and better to use.

Presentation

In this Video Recording, Atreya provides a tour and explanation of his poster content.

Improvements on HSQL: A SQL-like language for HPCC Systems

Click on the poster for a larger image.