During the first year, the effort was split between technical implementation and go-to-market initiatives. The technical implementation followed the planned trajectory without significant technical hurdles.
Main technical achievements include:
* Unstructured Type Support: Sarus now supports free text columns, using pre-trained small language models.
* Flexibility for New Types: The Sarus transformer-based-SD-model allows easy support for new data types.
* DP-LLM-FT Module: Sarus has built a DP-LLM-FT module to fine-tune LLMs with DP guarantees without exposing training data.
* DP-RAG Module: A DP-RAG module for RAG queries with DP guarantees was built and open-sourced.
* Backbone for data manipulation: The delivery of the backbone is complete, maintaining user tracking, DP recursive compilation, pushes to external tables, and performance improvements.
* Qrlew: An open-source tool (Qrlew) for manipulating SQL queries to ensure DP has been developed.
* Advanced Types: Improved handling of ranges and possible values, with types inferred from data, validated and modifiable by the user. These types are propagated through SQL transforms.
* Docker Support: Support for pre-validated, cross-language, docker-based computations, which allow for encapsulating computation in any language as a Docker image and composing it with other Sarus operations.