
Datacurve
ActiveFrontier coding data for training and evaluating LLMs
About
We generate expert quality coding data at scale for fine-tuning LLMs
Founders
Serena Ge· FounderStarted building software in high school - built a climbing training app with Team Canada athletes. Studied at Waterloo CS for a year then dropped out. Worked with the Cohere CTO on LLM reasoning and coding capabilities through synthetic data. Went to YC W24, pivoted 3 times until Datacurve. Now scaling high quality coding data production pipelines at Datacurve to enable next generation coding models
Charley Lee· FounderHacking on things since middle school. Went to Waterloo CS, interned at Google, then dove into AI research on multi-modal RL and training browser-use agents. Went through YC W24, pivoted a few times, and landed on Datacurve – now providing the data infrastructure for frontier LLMs.
Product launches · 1 launch
Providing code data by the best engineers, so you can build the most capable model