Show HN: Mastra – Open-source JS agent framework, by the developers of Gatsby

(github.com)

442 points | by calcsam 393 days ago

39 comments

Palmik 392 days ago
The example from the landing page does not exactly spark joy:
```
    testWorkflow
     .step(llm)
       .then(decider)
       .then(agentOne)
       .then(workflow)
     .after(decider)
       .then(agentTwo)
       .then(workflow)
      .commit();
```
On a first glance, this looks like a very awkward way to represent the graph from the picture. And this is just a simple "workflow" (the structure of the graph does not depend on the results of the execution), not an agent.
[-]
- campers 392 days ago
  I get the same feeing when I first looked at the LangChain documentation when I wanted to first start tinkering with LLM apps.
  I built my own TypeScript AI platform https://typedai.dev with an extensive feature list where I've kept iterating on what I find the most ergonomic way to develop, using standard constructs as much as possible. I've coded enough Java streams, RxJS chains, and JavaScript callbacks and Promise chains to know what kind of code I like to read and debug.
  I was having a peek at xstate but after I came across https://docs.dbos.dev/ here recently I'm pretty sure that's that path I'll go down for durable execution to keep building everything with a simple programming model.
  [-]
  - nwienert 392 days ago
    Kind of similar camp, I checked LangChain and others and ultimately I was like, well, it's not really doing much is it, just adding abstraction on top of what is essentially basic loops and conditional statements, and tbh it feels like in nearly every case I'll never be using them the same way such that some abstraction will help over just making some function helpers myself.
    I don't think from first principles there's any broad framework that makes sense to be honest. I'll reach for a specific vector DB, or logging library, but beyond that you'll never convince me your "query-builder" API is going to make me build a better thing when I have the full power of TypeScript already.
    Especially when these products start throwing in proprietary features and add-ons with fancy names on top.
  - jumski 392 days ago
    TypedAI looks solid, was not aware of it! Bookmarked for further research.
    Personally I am not fond of the decorator approach and decided to not use it in pgflow (my soon-to-be-released workflow orchestration engine on top of Postgres).
    1. I wanted it to be simple to reason about and explicit (being more verbose as a trade-off)
    2. There are some issues with supporting decorators (Svelte https://github.com/sveltejs/svelte/issues/11502, and a lot of others).
    3. I decided to only support directed acyclic graphs (no loops!) in order to promote simplicity. Will be supporting conditional recursive sub-workflows to provide a way to repeat some steps and be able to branch.
    Cheers!
  - CMCDragonkai 392 days ago
    Can dbos work with CF durable objects?
- calcsam 392 days ago
  Thanks! The conditional `when` clauses live on the steps, rather than being represented in the workflow, and in fact when we built this for an example, the last step being called depended on the results of the previous two steps.
  How would you simplify this?
  [-]
  - anentropic 392 days ago
    I think the problem is that a 'fluent' chain of calls already expresses a sequence, so the way that 'after' resets the context to start a new branch feels very awkward ... like a GOTO or something
    It's telling that the example relies on arbitrary indentation (which a linter will get rid of) to have some hope of comprehending it
    Possibly this was all motivated by a desire to avoid nested structures above all?
    But for a branching graph a nested structure is more natural. It'd also probably be nicer if the methods were on the task nodes instead of on the workflow, then you could avoid the 'step'/'then' distinction and have something like:
    e.g.
```
    testWorkflow(
        llm
        .then(decider)
        .then(
            agentOne.then(workflow),
            agentTwo.then(workflow),
        )
    )
```
    [-]
    - calcsam 392 days ago
      You’re right that the syntax was inspired by the desire to avoid nested structures. But the syntax here is interesting as well and fairly readable. Worth thinking about!
      [-]
      - anentropic 391 days ago
        that example syntax is loosely based on CDK code for AWS Step Functions, since I had to write some recently
        essentially you're building a DAG so it could be worth checking some other APIs which do a similar thing for inspiration
        e.g. it looks like in Airflow you could write it as:
        chain(llm, decider, [agentOne, agentTwo], workflow)
        https://airflow.apache.org/docs/apache-airflow/stable/core-c...
  - jumski 392 days ago
    I think it is just easier to comprehend if the edges/dependencies are explicit (as an array for example).
    [-]
    - calcsam 392 days ago
      We have a ticket to allow this actually!
- ranjanprj 390 days ago
  Same frustration with frameworks like Langchain and Llama_Index let me to build a simple UI base Agentic freamwork that runs locally. https://github.com/ranjanprj/agentollama
- jumski 392 days ago
  Yeah, I also found this a bit unintuitive at first. I’m building a workflow engine myself (https://pgflow.dev/pgflow, not released yet), and I’ve been thinking a lot about how to model the DSL for the graph and decided to make dependencies explicit and use method chaining for expansion with other step types.
  Here’s how it would look like in my system:
```
  new Flow<string>()  
    .step("llm", llmStepHandler)  
    .step("decider", ["llm"], deciderStepHandler)  
    .step("agentOne", ["decider"], agentOneStepHandler)  
    .step("agentTwo", ["decider"], agentTwoStepHandler)  
    .step("workflow", ["agentOne", "agentTwo"], workflowStepHandler);  
```
  Mine is a DAG, so more constrained than the cyclic graph Mastra supports (if I understand correctly).
- zeroq 392 days ago
  I knew it will be bad when I seen "by the developers of Gatsby", but this is pure comedy.
  JQuery plugin for LLM.
kylemathews 393 days ago
Very excited about Mastra! We have a number of Agent-ic things we'll be building at ElectricSQL and Mastra looks like a breath of fresh air.
Also the team is top-notch — Sam was my co-founder at Gatsby and I worked closely with Shane and Abhi and I have a ton of confidence in their product & engineering abilities.
[-]
- cpursley 393 days ago
  Why not use Elixir for agents as Electric is already heavily invested? It’s a much better fit than JS.
  [-]
  - mvf4z7 393 days ago
    Gretchen, stop trying to make Elixir happen.
  - funerr 393 days ago
    I think it is actually a solid choice given the startup ecosystem and generally easy async nature.
- doctorpangloss 392 days ago
  Abhi is one of the best engineers I know. I’m excited that he and his colleagues are tackling this problem.
joshstrange 393 days ago
This looks awesome! Quick question, are there plans to support SSE MCP servers? I see Stdio [0] are supported and I can always run a proxy but SSE would be awesome.
[0] https://mastra.ai/docs/reference/tools/client
[-]
- nilslice 393 days ago
  we have a tutorial that covers this!
  https://docs.mcp.run/tutorials/mcpx-mastra-ts
  you don't even need to use SSE, as mcp.run brings the tools directly to your agent, in-process, as secure wasm modules.
  mcp.run does have SSE support for all its servlet tools in the registry though too.
- tybaa 393 days ago
  Added support in this PR https://github.com/mastra-ai/mastra/pull/1957! Isn't shipped just yet but will be soon
- tybaa 393 days ago
  Hey! Glad to hear you're excited about it! Yes, we're currently working on improving our MCP support in general - we'll have more to share soon, but part of that is supporting SSE servers directly
  [-]
  - joshstrange 393 days ago
    Very cool. Like I said I can make it work with Stdio but I have a SSE MCP proxy I wrote to combine multiple MCP servers (just to make plugging in all my tools to a new client easier to test). That said, I think after looking at the docs that I'll be tempted to move my tools in directly but I probably will keep them behind MCP for portability.
    [-]
    - tybaa 393 days ago
      Oh nice, did you write your own proxy or are you using something like https://www.npmjs.com/package/mcp-proxy ?
      [-]
      - joshstrange 393 days ago
        I have used `mcp-proxy` but (afaik) you can only use it 1-to-1 and I wanted an N-to-1 proxy so that instead of configuring all my MCP servers in the multiple clients I've tested out I could just add 1 server and pull in everything.
        I found `mcp-proxy-server` [0] which seemed like it would do what I want but I ran into multiple problems. I added some minor debug logging to it and the ball sort of rolled downhill from there. Now it's more my code than what was there originally but I have tool proxying working for multiple clients (respecting sessionIds, etc) and I think I've solved most all the issues I've run into and added features like optional tool prefixing so there isn't overlap between MCP servers.
        Given what I know now, I don't think N-to-1 is quite as useful as I thought. Or rather, it really depends on your "client". If you can toggle on/off tools in your client then it's not a big problem but sometimes you don't want "all" the tools and if you client only allows toggling per MCP server then you will have an issue.
        I love the ideas of workflows and how you have defined agents. I think my current issue is almost too many tools and the LLM sometimes gets confused over which ones to use. I'm especially thrilled with your HTTP endpoints you expose for the agents. My main MCP server (my custom tools I wrote, vs the third-party ones) exposes an HTTP GUI for calling the tools (faster iteration vs trying it through LLMs) and I've been using that and 3rd-party chat clients (LibreChat and OpenWebUI) as my "LLM testing" platform (because I wasn't aware of a better options) but neither of those tools let you "re-expose" the agents via an API.
        All in all I'm coming to the conclusion that 90% of MCP servers out there are really cool for seeing what's possible but it's probably best to write your own tools/MCP since most all MCP servers are just thin wrappers around an API. Also it's so easy to create an MCP server that they are popping up all over the place and often of low quality (don't fully implement the API, take shortcuts for the authors use-case, etc). Using LLMs to writing the "glue" code from API->Tool is fairly minor and I think is worth "owning". To sum that all up: I think my usage of 3rd party MCP servers is going to trend towards 0 as I "assimilate" MCP servers into my own codebase for more control but I really like MCP as a way to vend tools to various different LLM clients/tools.
        [0] https://github.com/adamwattis/mcp-proxy-server
        [-]
        tybaa 393 days ago
        Thanks for sharing! It's so helpful to hear real world experiences like this. Would you be interested in meeting up on a call sometime? I'd love to chat about how you're using MCP to help inform how we can make all of this easier for folks. We're actively thinking about our APIs for tool use and MCP right now.
        [-]
        joshstrange 393 days ago
        I appreciate the offer but I think you'll probably find someone better to talk to here in the comments.
        MCP is super cool and I've loved playing with it but playing with it is all I'm doing. I'm working on some tools to use in my $dayJob and also just using it as an excuse to learn about LLMs and play with new tech. Most my work is writing tools that connect our to our distributed fleet of servers to collect data, run commands, etc. My goal is to build a SlackOps-type bot that can provide extra context about errors we get in Slack (Pull the latest commits/PRs around that code, link to current deployed version, provide all the logs for the request that threw an error, check system stats, etc). And while I have tools written to do all of that I'm still working on bringing it all together in something more than a bot I can invoke from Slack and make MCP calls.
        All that to say, I'm not a professional user of MCP/Mastra and my opinion is probably not one you want shaping your framework.
        [-]
        tybaa 393 days ago
        No worries! But I am definitely interested in chatting still - that you've tried it in multiple ways, ran into pain points, and overcame those in your own ways is super interesting and valuable. Playing around is how everyone starts and this "agents with tool use in prod" game is still very new. These APIs should work well and make sense for folks who are just getting into it as well folks who have been around the block. If you change your mind let me know! Would love to chat
alanwells 393 days ago
Happy Mastra user here! Strikes the right balance between letting me build with higher level abstractions but providing lower level controls when needed. I looked at a handful of other frameworks before getting started and the clarity & easy of use of Mastra stood out. Nice work.
[-]
- calcsam 393 days ago
  thank you!
brap 393 days ago
I don’t really understand agents. I just don’t get why we need to pretend we have multiple personalities, especially when they’re all using the same model.
Can anyone please give me a usecase, that couldn’t be solved with a single API call to a modern LLM (capable of multi-step planning/reasoning) and a proper prompt?
Or is this really just about building the prompt, and giving the LLM closer guidance by splitting into multiple calls?
I’m specifically not asking about function calling.
[-]
- coffeemug 393 days ago
  If you ignore the word "agent" and autocomplete it in your mind to "step", things will make more sense.
  Here is an example-- I highlight physical books as I read them with a red pen. Sometimes my highlights are underlines, sometimes I bracket relevant text. I also write some comments in the margins.
  I want to photograph relevant pages and get the highlights and my comments into plain text. If I send an image of a highlighted/commented page to ChatGPT and ask to get everything into plain text, it doesn't work. It's just not smart enough to do it in one prompt. So, you have to do it in steps. First you ask for the comments. Then for underlined highlights. Then for bracketed highlights. Then you merge the output. Empirically, this produces much better results. (This is a really simple example; but imagine you add summarization or something, then the steps feed into each other)
  As these things get complicated, you start bumping into repeated problems (like understanding what's happening between each step, tweaking prompts, etc.) Having a library with some nice tooling can help with those. It's not especially magical and nothing you couldn't do yourself. But you also could write Datadog or Splunk yourself. It's just convenient not to.
  The internet decided to call these types of programs agents, which confuses engineers like you (and me) who tend to think concretely. But if you get past that word, and maybe write an example app or something, I promise these things will make sense.
  [-]
  - fryz 393 days ago
    To add some color to this
    Anthropic does a good job of breaking down some common architecture around using these components [1] (good outline of this if you prefer video [2]).
    "Agent" is definitely an overloaded term - the best framing of this I've seen is aligns more closely with the Anthropic definition. Specifically, an "agent" is a GenAI system that dynamically identifies the tasks ("steps" from the parent comment) without having to be instructed that those are the steps. There are obvious parallels to the reasoning capabilities that we've seen released in the latest cut of the foundation models.
    So for example, the "Agent" would first build a plan for how to address the query, dynamically farm out the steps in that plan to other LLM calls, and then evaluate execution for correctness/success.
    [1] https://www.anthropic.com/research/building-effective-agents [2] https://www.youtube.com/watch?v=pGdZ2SnrKFU
    [-]
    - eric-burel 393 days ago
      This sums up as ranging from multiple LLM calls to build a smart features to letting the LLM decide what to do next. I think you can go very far with the former but the latter is more autonompus in unconstrained environments (like chatting with a human etc.)
- bravura 393 days ago
  https://aider.chat/2024/09/26/architect.html
  "Aider now has experimental support for using two models to complete each coding task:
  An Architect model is asked to describe how to solve the coding problem.
  An Editor model is given the Architect’s solution and asked to produce specific code editing instructions to apply those changes to existing source files.
  Splitting up “code reasoning” and “code editing” in this manner has produced SOTA results on aider’s code editing benchmark. Using o1-preview as the Architect with either DeepSeek or o1-mini as the Editor produced the SOTA score of 85%. Using the Architect/Editor approach also significantly improved the benchmark scores of many models, compared to their previous “solo” baseline scores (striped bars)."
  In particular, recent discord chat suggests that o3m is the most effective architect and Claude Sonnet is the most effective code editor.
  [-]
  - hassleblad23 392 days ago
    Now next is to have a Senior Editor and Editor pair :)
- weego 393 days ago
  I don't get it either. Watching implementations on YouTube etc it primarily it feels like a load of verbiage trying to carve out a sub-industry, but the meat on the bone just seems to be defining discreet units of AI actions that can be chained into workflows that interact with non-ai services.
  [-]
  - jacobr1 393 days ago
    > defining discreet units of AI actions that can be chained into workflows that interact with non-ai services.
    You got. But that is the interesting part! To make AI useful, beyond basic content generation in a chat context you need interaction with the outside world. And you may need iterative workflows that can spawn more work based on the output of those interactions. The focus on Agents as personas is a tangent to the core use case. We could just call this stuff "AI Workflow Orchestration" or something ... and it would remain pretty useful!
    [-]
    - karn97 393 days ago
      I wont trust an agent with anything by itself at their current state though.
- 2pointsomone 393 days ago
  I don't work in prompt engineering but my partner does and she tells me numerous need for agents in cases where you want some technology which goes and seeks things on the live web and then comes back and you want to make sense of that found data with the LLM and pre-written prompts where you use that data as variables, and then possibly go back into the web if the task remains unsolved.
  [-]
  - dimgl 393 days ago
    Can't that be solved with regular workflow tools and prompts? Is that what an agent is, essentially?
    Or is an agent a collection of prompts with a limited set of available tools?
    [-]
    - 2pointsomone 392 days ago
      I think the agent part is deciding how to navigate the web on its own and when it is convinced (and you haven't told it specifically deterministically) it found what it wanted, to come back and work with your prompts. You can't really logic code this into a workflow.
- ToJans 392 days ago
  AI seems to forget more things as the context window grows. Agents keep scope local and focused, so you can get better/faster results, or use models trained on specific tasks.
  Just like in real life, there's generalists and experts. Depending on your task you might prefer an expert over a generalist, think f.e. brain surgery versus "summarize this text".
- blainm 393 days ago
  One of the key limitations of even state-of-the-art LLMs is that their coherence and usefulness tend to degrade as the context window grows. When tackling complex workflows, such as customer support automation or code review pipelines - breaking the process into smaller, well-defined tasks allows the model to operate with more relevant and focused context at each step, improving reliability.
  Additionally, in self-hosted environments, using an agent-based approach can be more cost-effective. Simpler or less computationally intensive tasks can be offloaded to smaller models, which not only reduces costs but also improves response times.
  That being said, this approach is most effective when dealing with structured workflows that can be logically decomposed. In more open-ended tasks, such as "build me an app," the results can be inconsistent unless the task is well-scoped or has extensive precedent (e.g., generating a simple Pong clone). In such cases, additional oversight and iterative refinement are often necessary.
- jacobr1 393 days ago
  One way to think about it is job orchestration. You end up with some kind of DAG of work to execute. If all the work you are doing is based on context from the initiation of the workflow, then theoretically you could do everything in a single prompt. But more interesting is when there is some kind of real-world interaction, potentially multiple. Such as a websearch, or executing code, calling an API. Then you take action based on the result of then. Which in turn might trigger another decision to take some other action, iteratively, and potentially branching.
- nsonha 393 days ago
  Without checking out this particular framework, the word is sometimes overloaded with that meaning (LLM personality), but actually in software engineering in general, "agent" generally means something with its own inner loop and branching logic (agent as in autonomy). It's a neccessary abstraction when you compose multiple workflows together under the same LLM interface, things like which flow to run next, and edge case handling for each of them etc.
- andrewmutz 393 days ago
  Modularity. We could put all code in a single function, it is possible, but we prefer to organize it differently to make it easier to develop and reason about. Agents are similar
Gakho 393 days ago
Congrats on launching. I've noticed that switching prompts without edits between different LLM providers has degradation on performance. I'm wondering if you guys have noticed how developers do these "translations", I'm wondering since maybe your eval framework might have data for best practices.
[-]
- calcsam 393 days ago
  Yeah, this is something we've heard as well. No particular feature right now but we did ship an agent in local dev to help people improve their prompts.
  [-]
  - Gakho 393 days ago
    I'm wondering since there seem to be a lot of frameworks/websites that support evals, even OpenAI has evals.
    Do you think that a lot of these components like observability and evals will eventually be consumed by either providers (like OpenAI) or an orchestration framework like Mastra (when using multiple providers, though even if you're using just one provider for many tasks I can see it belonging to the orchestration framework)?
    [-]
    - calcsam 393 days ago
      I could be wrong but don't think OpenAI wants to be opinionated about that, except maybe the OpenAI solutions engineers :)
  - swyx 393 days ago
    link to this agent?
    [-]
    - calcsam 393 days ago
      demo: https://x.com/calcsam/status/1889856384549982419
epolanski 393 days ago
By the developers of Gatsby is a minus, not a plus makes me think this is going to be the next abandonware.
[-]
- paultannenbaum 393 days ago
  Surprised this is comment is not higher. Gatsby was one of the worst technologies I have worked with in my long career of working with various JS libraries and frameworks. Im sure the team is smart and capable, but I would not be advertising their work with Gatsby.
- christina97 393 days ago
  Same experience, I had the exact same thought. Was new to react and had to make a website… big mistake, wasted so many hours untangling the regex and hacks keeping together Gatsby over the next few years until that website was retired.
- squillion 393 days ago
  Gatsby never made sense to me. Weird design decisions I couldn’t find any plausible reason for. As soon as Next.js became capable of doing SSG I convinced my team to abandon Gatsby. Definitely a minus, sorry.
- user9999999999 393 days ago
  gatsby was one of the first static react frameworks, now you have things like nextjs remix astro etc... i dont think abandonware is fair, thats just the way software goes
  [-]
  - mplewis 393 days ago
    The Gatsby team made a lot of promises upon which they didn't follow through. Not a great way to build confidence in your next big project.
    [-]
    - DSchau 393 days ago
      … such as?
- benatkin 393 days ago
  The character Gatsby didn't function very well either (as far as being a successful person goes, I quite liked the book and he functioned well as a character) :)
  However, the Gatsby CMS had a couple of things that were really interesting about it - especially runtime type safety through GraphQL and doing headless WordPress.
  [-]
  - epolanski 392 days ago
    Interesting, because GQL was the most divisive thing of Gatsby.
_pdp_ 393 days ago
I don't want to be that person but there are hundreds of other similar frameworks doing more or less the same thing. Do you know why? Because writing a framework that orchestrates a number of tools with a model is the easy part. In fact, most of the time you don't even need a framework. All of these framework focus on the trivial and you can tell that simply by browsing the examples section.
This is like 5% of the work. The developer needs to fill the other 95% which involves a lot more things that are strictly outside of scope of the framework.
[-]
- calcsam 393 days ago
  Some people don't like frameworks. Some people do. We have a little bit of experience building frameworks, so we figured we'd build a good one.
  [-]
  - santa_boy 393 days ago
    I love frameworks :)
- incanspyder 392 days ago
  Couldn't agree more. This also looks mostly like a Typescript "port" of Langgraph, and I say "port" because Langgraph has a TS framework already.
- fsndz 393 days ago
  True. That's the reason I see a lot of people dropping similar frameworks like LangChain recently: https://medium.com/thoughts-on-machine-learning/drop-langcha...
  [-]
  - jerrygoyal 392 days ago
    i was using vercel ai sdk for my production app and it was such a bad experience that I eventually went with native implementation and tbh it was not much of work thanks to cursor. problems i faced: too many bugs (just browse their github repo to get an idea), the UI side also had suboptimal performance based on how they implemented hooks.
    [-]
    - ilrwbwrkhv 392 days ago
      vercel's whole shtick is to make money off of dumb js devs who do not know better. i think they pay far too much attention on how things look compared to how things work. but hey, they made millions, possibly billions off of those js devs so who is to blame them.
- cpursley 393 days ago
  I agree, and it feels like JS is just the wrong runtime for agents. Really languages that can model state in sane ways and have a good concurrency story like Elixir make much more sense.
  And here’s a fun exercise: ask Claude via Cursor or Perplexity with R1 to create a basic agentic framework for you in your language of choice on top of Instructor.
  [-]
  - mikehostetler 393 days ago
    > good concurrency story like Elixir make much more sense
    Agree, that's why I've been building this: https://github.com/agentjido/jido
    [-]
    - MattDaEskimo 392 days ago
      Call me an elixir virgin until 5 minutes ago. This language from a quick glance seems perfect for agent orchestration.
      Project looks great, will follow & learn.
      [-]
      - cpursley 392 days ago
        It's less about the language syntax and more about the capabilities of the underlying Erlang runtime. There's also Gleam on top of Erlang if you like stronger typing (gleam.run).
  - CharlieDigital 393 days ago
```
    > Really languages that can model state in sane ways and have a good concurrency story like Elixir make much more sense.
```
    Can you expand on this? Curious why JS state modelling falls short here and what's wrong with the concurrency model in JS for agents.
    [-]
    - dartos 393 days ago
      For one, NodeJS doesn’t have concurrency. It’s a single threaded event loop.
      [-]
      - CharlieDigital 393 days ago
        It has concurrency with Promise; it doesn't have parallelism.
        [-]
        cjonas 392 days ago
        And these agents are all network I/O bound by the model services so a lot of use cases don't need threading.
        I would argue that python is the overrated language when it comes to building agents. Just because it's the language of choice for training models doesn't mean it should be for building apps against them.
        The dx typescript brings to these types of applications is nice.
        [-]
        CharlieDigital 392 days ago
        > The dx typescript brings to these types of applications is nice.
        Ironically, it only gets halfways there.
        What I've found is that teams that want TS probably should just move up to C#; they are close enough [0]. The main thing is that once you start to get serious with your backend API, then data integrity matters. TS types disappear at runtime and it's just JS. So you need a Zod or Valibot to validate the incoming data. Then your API starts getting bigger and you want to generate OpenAPI for your frontend. Now your fast and easy Node/Express app is looking a lot like Spring or .NET...without the headway and perf...the irony.
        [0] https://github.com/CharlieDigital/js-ts-csharp
        holoduke 392 days ago
        No real concurrency. No scheduling. If you are not working with a lot of IO then js would be a poor choice. But in this case we talk about network calls, so definitely IO. The settimout, promise, request methods will do their job.
- fullstackwife 393 days ago
  You could describe all frontend JS frameworks the same way: you spend 95% of time on content and mechanics of your webapp, while the framework provides the easy 5%.
  [-]
  - chipgap98 393 days ago
    I think most JS frameworks save more than 5% of the effort for developers compared to writing raw JS. Especially when you include the ecosystem around those frameworks
harliem 393 days ago
Impressive. Have you seen any success with Mastra being used to build voice agents? Our company has been experimenting with VAPI, which just launched a workflow builder into open beta (https://docs.vapi.ai/workflows), but it has a lot of rough edges.
[-]
- calcsam 393 days ago
  We're just starting to do that and have a few TTS providers: ElevenLabs, OpenAI, PlayAI.
  We hear a lot from people who are outgrowing the voice agent platforms and moving to something like pipecat (in Python), and we'd love to be the JS option.
  [-]
  - jmkni 392 days ago
    Is any of the voice stuff in any way 'natural' sounding, I'd love to be able to recreate the ChatGPT app voice experience in my own app with a custom agent, but it just sounds robotic and crap
- soulofmischief 393 days ago
  If you'd like, feel free to reach out to me via email with your requirements and we can get a conversation going. I've built a few voice agent systems in both python and JavaScript and would love to hear about what issues you're running into. Might be able to build what you need.
eliotthehacker 393 days ago
I basically learned everything about how agents work by using Mastra's framework and going through their documentation. The founders are also super hands-on and love to help!
aranibatta 393 days ago
Congrats on launching! Curious how early the Mastra team thinks people should be thinking about evals and setting up a pipeline for them.
[-]
- calcsam 393 days ago
  We tend to recommend folks spend a few hours writing evals after they spend a couple weeks prototyping. Then they get a sense of how valuable evals are for their use-case.
  We think about evals a bit like perf monitoring -- it's good to have RUM but also good to have some synthetic stuff in your CI. So if you do find them valuable, useful to do both.
netcraft 393 days ago
This looks really great! How do you make money? Do you charge for deploying these to your platform? I couldnt find anything on pricing
[-]
- calcsam 393 days ago
  If you watch the demo video you will see the cloud platform we are building at the end. Right now it’s in beta.
monideas 393 days ago
Are there any plans to add automatic retries for individual steps (with configurable max attempts and backoff strategy)?
davedx 393 days ago
Why is it on top of Vercel’s platform?
[-]
- netcraft 393 days ago
  It looks like theyre using the vercel ai sdk, which really isnt the vercel platform, doesnt have anything to do with any of the rest of vercel. Its actually quite nice and full featured.
- calcsam 393 days ago
  It’s not. It’s on top of AI SDK, which is a popular open source library maintained by Vercel.
  [-]
  - tomhallett 393 days ago
    So the vercel ai js-sdk is not tied to the vercel platform and is “just” a js library which points to various llms? Is there any promise/plan/etc to keep it that way?
    [-]
    - leerob 393 days ago
      (I work at Vercel) Yes, it will continue to be an MIT-licensed open-source library to simplify building AI apps.
PetrBrzyBrzek 392 days ago
I created a similar library for orchestrations, but it’s more explicit and lightweight. https://github.com/langtail/ai-orchestra
lmrl 392 days ago
Congrats, looks promising! 1. Is it possible to create custom endpoints? I see that several endpoints are created when running “mastra dev”.
2. Related to previous question, since this is node based, is it possible to support websockets?
[-]
- calcsam 392 days ago
  1. what are you wanting to create?
  currently agents and workflow endpoints are created at `/api/workflows/workflow-id` and `/api/agents/agent-id/` for workers and agents
  2. we are thinking about it -- curious what you'd use it for?
orliesaurus 392 days ago
Does Mastra support libraries of tools for agents like toolhouse.ai or https://github.com/transitive-bullshit/agentic
[-]
- calcsam 392 days ago
  Agentic's tool library _should_ also work for Mastra via its AI SDK adapter.
  (We haven't tested this, so if you do try let us know if you see quirks!)
  [-]
  - orliesaurus 392 days ago
    What about Toolhouse and/or composeio?
    [-]
    - tough 390 days ago
      in npm there's a mastra/composeio package that might work, they also seem to have some mcp support
dstroot 392 days ago
Congrats! Side question - is the website OS as well? I'd like to "borrow" the Nav Bar code. I looked on GitHub and couldn't find it in the repos and 300+ branches. Cheers!
dhorthy 393 days ago
i am very long on TS as the future of agent applications. nice work team
[-]
- calcsam 393 days ago
  thanks!!
_1 393 days ago
This looks really nice. We've been considering developing something very similar in-house. Are you guys looking at supporting MLC Web LLM, or someother local models?
[-]
- calcsam 393 days ago
  Yup! We rely on the AI SDK for model routing, and they have an Ollama provider, which will handle pretty much any local model.
  [-]
  - tough 390 days ago
    can confirm this works well with any OpenAI like API endpoint, like ollama or LM studio's
realmikebernico 393 days ago
Congrats! This is exactly what the AI world needs. I'm thinking about using Mastra for a class I'm working on with AI Agents.
[-]
- ash_091 393 days ago
  So an AI Mastra Class?
- calcsam 393 days ago
  that's awesome!
5Qn8mNbc2FNCiVV 393 days ago
I thought Kyle Matthews was the creator of Gatsby
[-]
- calcsam 393 days ago
  Kyle started the project, I started helping pretty shortly thereafter, then he and I cofounded the company together. Kyle's working on ElectricSQL now but is using us, we're doing a meetup together next month, etc.
  [-]
  - thruflo 393 days ago
    Come along :)
    https://lu.ma/sync-sf
- dang 393 days ago
  I put the "creators" bit in the title because I thought readers would find it interesting. Sorry if that was not-quite-right! I've turned them into developers now.
fnikacevic 393 days ago
Do the workflows support voice-to-voice models like openai's realtime? Or if something like that exists I'd be curios.
cshimmin 392 days ago
Interested to learn more about the PDF -> CAD project built on mastra, can you share a link?
[-]
- calcsam 392 days ago
  https://www.artifact.engineer/
tobyhinloopen 392 days ago
Neat, I’m going to use this
[-]
- calcsam 392 days ago
  awesome! let us know how it goes
dikaio 392 days ago
Got excited, was hoping to see a repository of Go Agents.
levensti 393 days ago
Super excited to try out the new agent memory features
[-]
- swyx 393 days ago
  interesting to contrast the recent memory releases
  - https://mastra.ai/docs/agents/01-agent-memory
  - https://blog.langchain.dev/langmem-sdk-launch/
  - https://help.getzep.com/concepts#adding-memory
  not sure where all this is leading yet but glad people are exploring.
  [-]
  - calcsam 393 days ago
    100% and agree with this, we saw the langmem stuff last night
    imho getting some sort of hierarchical memory is conceptually fairly straightforward, the tricky part is having the storage and vector db pieces well integrated so that the apis are clean
- calcsam 393 days ago
  let us know what you think!
asati 393 days ago
Congrats guys! really excited to try this out!
albertmz 391 days ago
Kudos on using XState!
gregpr07 393 days ago
Any timeline for python?
[-]
- thinkxl 392 days ago
  You probably already know, but in case you don't, python has phidata[1]
  [1]: https://docs.phidata.com/introduction
- calcsam 393 days ago
  Not planning on it — we think frameworks should be single-language
fuddle 393 days ago
"You may not provide the software to third parties as a hosted or managed service" - The Elastic v2 license isn't actually open source like your title mentions: "Open-source JS agent framework"
https://github.com/mastra-ai/mastra/blob/main/LICENSE
[-]
- calcsam 393 days ago
  I mentioned that in the comment. We’re using Elastic v2 for now because we want users to be able to do anything with us, but protect from eg AWS
  [-]
  - fuddle 393 days ago
    If the license isn't open source, then the SDK shouldn't be labeled as open source.
  - Tomte 393 days ago
    So it‘s a lie.
delduca 393 days ago
> Mastra uses the Vercel AI SDK
It started off wrong.
[-]
- jcheng 393 days ago
  Care to elaborate? I’ve never used it but have heard good things from colleagues who have.
  [-]
  - delduca 393 days ago
    Lock in.
    [-]
    - SparkyMcUnicorn 392 days ago
      What lock in?
      I use their AI SDK, but never touch vercel servers. It's just a unified interface.
      [-]
      - delduca 392 days ago
        The SDK is the lock in.
        [-]
        SparkyMcUnicorn 391 days ago
        Same as any other open source framework or library.
        Calling that "lock in" is a stretch, but you're free to write everything from scratch if that's the way you roll.
jobryan 393 days ago
Bamfs
[-]
- calcsam 393 days ago
  lol thanks
pablodecm 393 days ago
Very interesting set of abstractions that address lots of the pain points when building agents, also the team is super eager to help out!
[-]
- calcsam 393 days ago
  thank you!
animanoir 393 days ago
[dead]
yovboy 393 days ago
You’re awesome guys! I had so many problems with lanchain and am very happy since switching to Mastra
[-]
- ge96 393 days ago
  that sus account with no activity until now
- calcsam 393 days ago
  that's great to hear!!
bobremeika 393 days ago
A TypeScript first AI framework is something that has been missing. How do you work with AI SDK?
[-]
- swyx 393 days ago
  idk man
  https://js.langchain.com/docs/introduction/
  https://www.vellum.ai/products/workflows-sdk
  https://github.com/transitive-bullshit/agentic
  which is not to say any of them got it right or wrong, but it is by no means "missing". the big question w all of them is do they deliver enough value to last. kudos to those who at least try, of course
- calcsam 393 days ago
  We originally were wrapping AI SDK, but that confused people who wanted to use both, so we decided to make the API more explicit, eg:
  import { Agent } from "@mastra/core/agent"; import { openai } from "@ai-sdk/openai";
  export const myAgent = new Agent({ name: "My Agent", instructions: "You are a helpful assistant.", model: openai("gpt-4o-mini"), });
- soulofmischief 393 days ago
  Mine is written in TypeScript and I still think it's more ergonomic than anything else I'm seeing in the wild. Maybe there's finally an appetite for this stuff and I should release it. The Mastra dashboard looks pretty nice, might take some notes from it.
- campers 392 days ago
  https://typedai.dev is another full-featured one I've built, with a web UI, multi-user support, code editing agents, CodeAct autonomous agent