More

aaronvg · 2025-11-05T18:43:48 1762368228

It's 'basically, a made-up language'. It's just tongue-in-cheek because when we started this it was just a ridiculous proposition to try and make a DSL.

I'll add it in!

aaronvg · 2025-11-03T19:28:12 1762198092

Boundary (YC W23) | Software engineer (compilers) | Seattle, USA (in person) | Full-time

We are building a new programming language (BAML) to build AI agents -- the "typescript" for LLMs. We are open source: https://github.com/BoundaryML/baml

A big part of this language is all the tooling around visualizing non-deterministic code, visualizing code, and getting great observability (e.g. our language has type information at runtime unlike TS).

We are looking for engineers with experience with Rust, programming languages, and/or compilers. Any amount of experience is fine.

To apply: send an email to aaron@boundaryml.com with your resume and mention you came from HN

aaronvg · 2025-08-13T20:44:03 1755117843

definitely will just say picks...

aaronvg · 2025-08-10T15:13:02 1754838782

You may also want to check out BAML https://github.com/BoundaryML/baml - a DSL for prompt templates that are literally treated like functions.

the prompt.yaml format (which this project uses) suffers from the fact that it doesn't address the structured outputs problem. Writing schemas in yaml/xml is insanely painful. But BAML just feels like writing typescript types.

I'm one of the developers!

aaronvg · 2025-07-14T18:45:42 1752518742

super interesting to see how this is marketed:

- Created by an AWS team but aws logo is barely visible at the bottom.

- Actually cute logo and branding.

- Focuses on the lead devs front and center (which HN loves). Makes it seem less like a corporation and more like 2 devs working on their project / or an actual startup.

- The comment tone of "hey ive been working on this for a year" also makes it seem as if there weren't 10 6-pagers written to make it happen (maybe there weren't?).

- flashy landing page

Props to the team. Wish there were more projects like this to branch out of AWS. E.g. Lightsail should've been launched like this.

aaronvg · 2025-06-17T23:44:53 1750203893

We're making a prompting DSL (BAML https://github.com/BoundaryML/baml) and what we've found is that all the syntax rules can easily be encoded into a Cursor Rules file, which we find LLMs can follow nicely. DSLs are simple by nature so there's not too many rules to define.

Here's the cursor rules file we give folks: gist.github.com/aaronvg/b4f590f59b13dcfd79721239128ec208

mbokinala · 2025-06-18T00:48:20 1750207700

Anecdotally, Cursor's tab complete model learns BAML incredibly quickly

aaronvg · 2025-06-01T19:56:24 1748807784

You might also find Semantic Streaming interesting. It's t he same concept but applied to llm token streaming. It's used in BAML (the ai framework). https://www.boundaryml.com/blog/semantic-streaming

I'm one of the developers of BAML.

aaronvg · 2025-04-24T21:37:26 1745530646

it's kind of wild -- none of the multimillion dollar VSCode forks (Cursor, windsurf) are working properly at the moment. It seems open-vsx is quite a vulnerable single point of failure. Searching extensions gives a 503.

0cf8612b2e1e · 2025-04-24T22:02:18 1745532138

Is that any different from a GitHub/AWS outage?

nawgz · 2025-04-24T22:33:01 1745533981

Yes - no one is making 7 figures in order to keep OpenVSX online

aaronvg · 2025-03-24T20:28:46 1742848126

It's kind of insane going from 76% to 3% on the new version of a benchmark. We clearly need more rapid progress on the creation of benchmarks.

Then again, I wonder -- if a benchmark is way too hard from the beginning, would it make it much harder for people to test new solutions that actually have real-world impact, even if the new results on the hard benchmark only increased the score by 1%?

artninja1988 · 2025-03-24T20:32:17 1742848337

The new puzzles seem very hard. Took me quite a while to get some of them

aaronvg · on Jan 27, 2025

Sometimes you don't have access to a model, so this approach still works in that scenario.

We also have several users that have anecdotally told us they get worse results using constrained grammar solutions.