Latest Posts (10 found)
Matt Mazur 9 months ago

2024 Year in Review

So what’s the end goal here? I started Emergent Mind right after ChatGPT launched with the simple goal of building a great product in the AI space. The first two iterations (the ChatGPT examples site and AI news aggregator) were okay, but ultimately not interesting enough to continue with. Some good came out of them though, because they led me into the research space, which I’m now fascinated with and think I can have an impact in by building AI tools for scientists, engineers, and researchers to do better research. I’m in a weird spot too where I have this $10k+ MRR passive income business with Preceden that makes enough to support me, and while getting Emergent Mind there too would be great, it seems kind of like a hollow goal, and a missed opportunity to set a different, more ambitious type of goal. The way I’ve started thinking about it is that my mission with Emergent Mind is to build a product that contributes to someone making a scientific discovery that has a significant positive impact on the world. It sounds kind of delusional, I know, but I think the platform is actually very well positioned to do that in the future. That mission also provides clarity on questions that might have different answers if the goal was to build a big business to sell to Google. For example: We’ll see how this all goes. It might not play out like this, but it seems very much worth trying, and I’m enjoying the journey. Also, if you have any suggestions on Preceden or Emergent Mind, please drop me a note, I’d very much value that feedback. Until next time

0 views
Matt Mazur 1 years ago

It’s Time to Build

It’s been a few months so I wanted to say hey to the 7 of you who follow this blog and share a few updates about what I’ve been up to. At the start of 2023 I quit consulting to go full time on Preceden , my SaaS timeline maker, after growing it on the side for about 13 years . Around the same time I started working on LearnGPT (which would eventually become Emergent Mind ), and wound up spending about 70% of 2023 working on Preceden building out various AI capabilities like its visual timeline generator and 30% working on LearnGPT/Emergent Mind. In November I pivoted Emergent Mind from an AI news aggregator to an AI research aggregator, and I’ve been working on it full time since then. I’ve barely worked on Preceden since November. I answer about a dozen support emails each week and fix the occasional bug, but haven’t worked on any major product updates in a while. A good chunk of those support emails are refund requests, which I actually think is a good sign, because the lack of bug reports and feature requests reflect that the product is in pretty good shape. Preceden revenue is up about 5% year to date, the lowest it’s ever been. It’s tempting to see that and conclude that it’s because I haven’t worked on it in 5 months, but the reality is that churn finally caught up to new MRR growth, and it’s largely because of a subtle mistake I made in the fall. Preceden has always struggled to rank well for key search terms like “timeline maker”, despite it having pretty good SEO positioning. I realized around October that the reason for this might be because over its lifetime lots of users have created near-identical public timelines on historical topics, like hundreds of timelines on the Russian Revolution. Maybe Google was penalizing the site for this duplicate content. To remedy this, I used the AI timeline generator I built to generate around 200 timelines on common historical topics , and then 301 redirected about 20k public user-generated timelines to the AI-generated ones in an effort to reduce the amount of content on Google that it was possibly interpreting as spammy. Good thought, but one problem: I accidentally no-index all of those AI-generated timelines, and because I was heads down on Emergent Mind and not paying close enough attention to Preceden’s metrics, I didn’t realize it for about 4 months. Those 20k public timelines drove a lot of traffic and sign ups, and when I redirected them all to no-indexed pages, I lost all that traffic, and a good portion of Preceden’s new MRR disappeared as well. I got the AI-generated timelines re-indexed, but traffic hasn’t fully recovered, which is why revenue is up 5% and not higher like it’s been in the past . The good news though is that despite this mistake, Preceden continues to bring in income equivalent to a decently-paid developer’s salary, and it’s entirely passive, allowing me to pursue other things. I’m taking advantage of that and chilling on the beach reading all day. Except not at all. Emergent Mind helps people discover and learn about new AI/ML research. It gets 10k-15k visitors per month currently and people seem to get a lot of value out of it. The increasing frequency of emails like this from Emergent Mind users is a great sign . (Pretty sure this person is not a native English speaker and he used ChatGPT to draft this, but it still makes my day.) pic.twitter.com/Qlbe3Y4Shb And last week I rolled out some very early paid plans and it now has non-zero revenue coming in: With this first payment, Emergent Mind officially has MRR, woot woot pic.twitter.com/BeIl2LhaTA It’s not much, but it’s a start. Me: “$12 MRR hun…” Her: “So that’s like one millionth of a cent per hour you worked on it?” pic.twitter.com/gSisiYRPN3 The thing is though, I’m not optimizing for revenue right now. I think of Emergent Mind as a product lab operating at the intersection of LLMs, research, and education. The way I see it, we’re at a point right now similar to the mid-90s when internet usage exploded with the advent of AOL. Similar to how many companies from that time period focused on building out better infrastructure to enable broader and faster internet usage, there are lots of companies right now focused on building bigger, more powerful LLMs. And similar to 1995, I think we’re going to see a ton of innovation in the coming years in the type of products and businesses being built with this new technology. That’s what I want to focus on. I want to build tools in the research space at the frontiers of what’s possible with generative AI. I think we’ve seen like 2% of what’s going to be built with these technologies, and I want to spend most of my time exploring that other 98%. These will range from quick features that take several hours to launch, to some in the future that will take months to build. Some of these will be silly and most won’t go anywhere, but I think there’s a huge opportunity right now to tinker with an entrepreneurial mindset and create new types of innovative and hopefully useful products. Like, what if you put an agent in charge of your Twitter account and set it up to automatically optimize itself based on engagement? What if you built a deeply integrated chatbot into your site that tried to persuade visitors to sign up for your newsletter based on their usage of the site? If you have access to the latest scientific research, could you use LLMs to identify gaps in our knowledge? Could you use LLMs to fill in those gaps? Could you build an AI-enabled educational tool that helps a software developer gain fluency in the type of advanced math you might find in a diffusion paper? I don’t have the expertise to be confident about what’s going to work and what’s not (does anyone?), so I’m going to just experiment and learn and iterate and see where it goes. With Preceden’s passive income, I can pursue this for a while, not forever. I do have a small team of amazing contractors helping out ( Milan on design and Omar on AI engineering); it will be important to monetize Emergent Mind so I can support this team and possibly add more folks in the future. Ideally, Emergent Mind will make enough income at some point soon-ish where I can continue doing this long term without relying on Preceden’s income to support it. Honestly there’s nothing else I’d rather be doing right now. For me, building a software business has always been about freeing up my time so I can spend more time learning and building. It took a while, but I’m kind of at that point right now where I can do that all day without being laser-focused on revenue growth. I have no idea how this approach will play out, but I’m excited to see what happens. Thanks for following along .

0 views
Matt Mazur 1 years ago

My Indie SaaS Revenue has Grown 37% per Year for 13 Years

For any indie founders out there who have not seen hockey stick growth for their product, I hope this serves as some evidence that it is possible (and perfectly fine!) to slowly grow your side project over many years. If you can maintain slow but consistent revenue growth year after year, it should eventually grow into a meaningful amount of revenue and give you options down the road, whether it be to go full time on it, or use it to support yourself while pursuing other projects (like I am now with Emergent Mind , a resource for staying informed about important new AI/ML research), or something else entirely. And even if you never go full time on it, the lessons you’ll learn trying to grow your business will make you a much more valuable employee and help you grow your salary, which is a great outcome as well. Drop me a note if you’re on a similar journey, I’d love to say hey: [email protected].

0 views
Matt Mazur 1 years ago

Is the ChatGPT API Refusing to Summarize Academic Papers? Not so fast.

It will grab the summarization prompt in prompt.txt , run it through the endpoint 10 times (or however many you choose), and output the responses to . Each request costs about a cent, so you don’t have to be too concerned about any experiments consuming your quota. If you run this script as-is, you’ll likely see about half of the requests result in refusals such as: It’s easy to see this and come to the conclusion that ChatGPT can no longer be reliably used for summarization tasks. But, reality is more complicated. Here’s the prompt Emergent Mind and this script are currently using, which I’ve iterated on over time to deal with various issues that popped up in the summaries: You will be given the content of a newly published arXiv paper and asked to write a summary of it. Here are some things to keep in mind: Here’s the paper: Now, take a deep breath and write a blog post about this paper. If we change the prompt though to simply ‘Please summarize the following paper,’ it seems to work 100% of the time. The problem doesn’t seem to have to do with summarizing papers, but about the guidance I provided about how to summarize the paper combined with the content of some papers. I spent a while this morning testing different combinations of those bullet points to figure out what’s causing the refusal, but couldn’t figure it out exactly. My impression is that it has something to do with the complexity of the guidance or because it thinks I’m attempting to do something shady with copyrighted work (note that earlier on the page it lists all of the paper’s authors, which is why I I’m excluding them from the summary). A few other things to note: So, in short, the ChatGPT 3.5 API occasionally refuses to generate complex summaries of some papers. This may be new behavior, or may not be. If anyone ends up experimenting with the script and learning anything new, or if you have any insights as to the behavior I’m seeing here, please drop me an email or leave a comment below, and I’ll update this post accordingly.

0 views
Matt Mazur 1 years ago

Reflecting on My First Year as a Full Time Indie Founder

Where to begin? :) Shortly after ChatGPT launched in late 2022, I launched LearnGPT , a site for sharing ChatGPT examples. The site gained some traction and was even featured in a GPT tutorial on YouTube by Andrej Karpathy . But, a hundred competitors quickly popped up, and my interest in continuing to build a ChatGPT examples site waned, so I decided to shut it down . But then I got some interest from people to buy it, so I put it up for sale , got a $7k offer, but turned it down , and then rebranded the site to Emergent Mind and switched the focus to AI news. A few months into that iteration, I lost interest again (AI news competition is also fierce, and I didn’t think Emergent Mind was competitive, despite some people really liking it), so tried selling it again. I didn’t get any high enough offers, so decided to shut it down, but then decided to keep it , even though I didn’t know what I’d do with it. And guess what: in November I had an idea for another iteration of the site, this time pivoting away from AI news and into a resource for staying informed about AI/ML research. I worked on that for a good chunk of November/December, and am currently mostly focused on it . I’m cautiously optimistic about this direction though: the handful of people that I’ve shared it with have been very enthusiastic about it and provided lots of great feedback that I’ve been working through. Unlike my previous product launches, I’m saving a HN/Reddit/X launch announcement for later, after I’ve gotten the product in really good shape. There’s still lots of issues and areas for improvement, and I believe now it’s a better route to soft launch and iterate on it quietly based on 1:1 feedback before drawing too much attention to an unpolished product. Hat-tip Hiten Shah for influencing how I think about MVPs . I’ll add too that this “surfacing trending AI/ML research” direction is the first step in a larger vision I have for the site. I think it could evolve into something really neat – maybe even a business – though time will tell. Preceden is in a good/interesting spot where it’s a fairly feature-complete product that requires very little support and maintenance. I don’t have any employees, and could not work on it for months and it would likely still grow and continue to work fine. When I look ahead, the most popular feature requests seem like they won’t be heavily used and will wind up bloating the product and codebase. That doesn’t mean there’s no room for improvement – there always is – just that I’m not sure it makes sense anymore for me to be so heads down in VS Code working on it. It’s the first time maybe ever that I’ve thought that. I’d probably see more business impact by spending my time on marketing, but that’s not exactly what I want to spend a lot of my time doing, plus I also can’t afford the kind of talent I’d need to market it effectively either (marketing a B2C horizontal SaaS isn’t fun). So, my current thinking is that I’ll keep improving and lightly marketing Preceden, but with less intensity than I have in years past. Instead, I’ll devote more of my time to building other products: Emergent Mind and maybe others in the future. Maybe one of those will turn into a second income stream but maybe not. I enjoy the 0 to 1 aspect of creating new products, and the income from Preceden supports me in pursuing that for now. And if Preceden starts declining, I can always start focusing on it again, or go back to contracting or a full time position somewhere, which isn’t a bad outcome either. Also, one thing I regret not doing more of in 2023 was spending more time wandering. It’s easy for me to get super focused on some project and not leave any time in my day for exploring what else is out there. Only toward the end of the year did I start experimenting with new AI tech like Mixtral . Going forward, I want to spend some time each week learning about, experimenting with, and blogging about new AI tech. I’m still very much in the “AI will change the world in the coming years” camp, and I have the freedom and interest to spend some of my time learning and tinkering, so am going to try to do that. As always, I welcome any feedback on how I’m thinking about things. Happy new year everyone and thanks for reading .

0 views
Matt Mazur 1 years ago

Running Mistral 7B Instruct on a Macbook

Unless you have a very powerful Macbook, definitely experiment with this model instead of the MOE model .

0 views
Matt Mazur 1 years ago

Emergent Mind in The Atlantic

You can read Full Fact’s article about this glass-eating snippet here: Google snippets falsely claimed eating glass has health benefits . As I noted on X , I quickly removed this page from Emergent Mind on the off-chance that someone misinterprets it as health advice. Something tells me this won’t be the last we’ll hear about Google misinterpreting ChatGPT examples on Emergent Mind. Until then…

0 views
Matt Mazur 1 years ago

Exploring ChatGPT’s Knowledge Cutoff

This highlights the need to not only consider the possibility of hallucinations in ChatGPT’s responses, but in the possibility of ignorance as well, especially for information about recent events. This will be less of an issue if you’re using GPT-4 with browsing enabled, but will still impact older models as well as usage of GPT-4 via the API or playground. Again, if anyone spots anything I’ve overlooked in this analysis, please drop me a note. Thanks for reading .

0 views
Matt Mazur 1 years ago

Experimenting with GPT-4 Turbo’s JSON mode

Hopefully this gives you an idea of what’s possible with the new JSON mode. Shout out to the OpenAI team for implementing it. And if you have any other tips about using this new feature, please drop a comment below and I’ll update this post accordingly. Cheers!

0 views