Cool, so are you actually using a LLM? If so, is it yours or are you borrowing s...

nmachado · on March 20, 2023

Yes, we are using an LLM for some parts of the code generation, specifically GPT-4. In the medium-term, we plan to go lower in the stack and have our own AI model. We broke down the process into modular steps to only leverage LLMs where it's most needed, and use rule-based methods in other parts of the process (e.g. in fixing compilation errors). This maximizes the accuracy of the transformations.

btbuildem · on March 21, 2023

Modular use of an LLM over a problem-specific workflow skeleton is the winning ticket. Nicely conceptualized!

Avicebron · on March 20, 2023

Do you have some sort of automatic test suite for what's generated by the LLM prior to release? Just to ensure what it returns won't break downstream?

robert-te-ross · on March 20, 2023

Yes, internally, we have separate models that produce tests the final data has to pass before being presented to the user. In addition, you can define your own tests on the platform, and we will ensure transformations produced will pass those tests before deployment. We also have helpful versioning and backtesting features.

jxnlco · on March 20, 2023

looks like it probs passes the source and target schema throught an LLM that generates a sql create statement. similar to https://magic.jxnl.co/data

and make a request like 'write me sql to map the existing tables to a new table with this schema'