Documenting Our AI Journey

Welcome to Hal Speaks, my blog on our journey from a simple IVR agent to a stateful AI assistant, including all the wins, frustrations, and opinions in between.

Ed’s Thinking…

What the %$#!, Railway?!

Ed

2 months ago

Well, today turned into chaos. Around 6:20pm ET May 19, Railway went completely down. Not only my deployments, but everyone else’s too. Update: Here’s Railway’s post-mortem. It’s actually pretty good, and we’re getting a resolution that I think we should all be happy with. Our entire AI platform (plus this blog – moving that ASAP)…
Read more
Did Anthropic Box OpenAI In For the Foreseeable Future?

Ed

2 months ago

I know: quite a long time since I last posted. Been spending a lot of time getting all the agents’ bugs worked out once and for all, and yes, I’m quite close. But that’s not why I’m here. Caught this post on BlueSky earlier today. While I don’t think Eris is wrong in saying this…
Read more
This. Is. Huge.

Ed

3 months ago

Google’s TurboQuant answers AI’s biggest problem: resource usage.
Read more
Week 1 Check-In: Going Better than I Thought

Ed

3 months ago

Here’s the first of my weekly check-ins, where I round up the work that I didn’t mention in a particular blog post during the week. It will also serve as a way for me and you to monitor progress.
Read more
The Anonymous Tester

Ed

3 months ago

I may have stumbled upon a potentially useful way to prevent a stateful agent’s memory from being “polluted” by likely incorrect or garbled data during testing.
Read more
Why I Chose Claude Haiku

Ed

4 months ago

Claude Haiku often feels like the lovechild of the Anthropic model family: afraid, ashamed, misunderstood, to quote the timeless Diana Ross. But it shouldn’t be that way.
Read more
Holy Sh*t, Is Mythos the Real Deal

Ed

4 months ago

The Claude Mythos system card is a read, and it feels like a massive shift in how Claude works is about to happen. Why? Sycophancy is dead.
Read more
Anthropic should learn from OpenAI

Ed

4 months ago

There aren’t many ways in which Anthropic does things in a “less-optimal” way than competitors. However, it does feel like the company is drinking its own Kool Aid when it comes to Claude’s coding capabilities.
Read more
He’s alive…

Ed

4 months ago

…and hallucinating tool calls. Well, it was this morning. More of an issue with the system prompt being a little too permissive. But glad to finally have this working!
Read more
Claude is becoming unusable for no reason

Ed

4 months ago

I am getting annoyed. I have no idea what is wrong with Claude right now, but there is a serious issue with token usage and limits. Thought people were nuts, but after three questions, I burned through my entire limit. On Sonnet.
Read more