
Stack Overflow CEO Prashanth Chandrasekar.
The question-and-answer website Stack Overflow was based 17 years in the past to permit programmers — human programmers — to publish questions on programming issues and get solutions from a neighborhood of like-minded people.
Since its founding, the world has develop into enamored with how ChatGPT and different generative AI applications can’t solely provide solutions however even carry out the work itself, handing you your individual customized code.
How ought to a neighborhood of individuals sharing data reply to the sudden enchantment of AI automation?
Stack Overflow's CEO, Prashanth Chandrasekar, has been operating the corporate for nearly six years — and he has a plan. I sat down with him this month to find out how that plan is coming collectively.
Additionally: What is Stack Overflow? A forum for all who code
"Issues have modified; we need to change with the occasions," Chandrasekar informed me. "We wished to go together with the path of the movement of the river."
The rise of ChatGPT
He didn't actually have a alternative. Beginning in 2022, the rise of ChatGPT had a right away affect on Stack Overflow's public website site visitors, which delivers the promoting that accounts for a big portion of its income.
A main manner during which folks got here to Stack Overflow was at all times a Google search. Paid search on Google was the "person interface to Stack Overflow," as Chandrasekar places it.
With the ability to ask a chatbot as a substitute of looking out shortly reduce into these Google queries, and site visitors started to say no.
Chandrasekar arrived at a broad philosophical conclusion about not simply AI and automation but additionally the web.
"Our view is that the character of the web has modified," he mentioned. It's not largely about paid search from human queries driving website site visitors. "The person interface has modified to be Gen AI instruments," he noticed.
"And, so, we're having to form of be as aware of that trade change as attainable. We have to additionally diversify" as a property.
Additionally: Stack Overflow joins Reddit and Twitter in charging AI companies for training data
Chandrasekar and staff realized there may be quite a lot of worth in Stack Overflow's 60 million solutions to handle the shortcomings of generative AI. One choice was to sue OpenAI and different makers of AI basis fashions. The "pre-trained" giant language fashions embody Stack Overflow conversations, that are legally the property of Stack Overflow.
"We mentioned, OK, we might try this, or we might do one thing a little bit bit extra, let's say, novel, relative to what everyone else was doing."
As an alternative of submitting go well with, Stack Overflow put in code to dam scraping of the general public web site, "after which we mentioned, 'in case you're constructing a chat bot or no matter, it’s a must to work with us. Let's have a good enterprise settlement.'"
Information as a service
On high of promoting and the paid enterprise model of Stack Overflow, referred to as Groups, Chandrasekar is constructing out a 3rd income often called data options — or, knowledge-as-a-service.
Chandrasekar has been signing offers to license Stack Overflow's content material, first with Google and OpenAI.
Up to now 18 months, he's additionally been busy signing "all of the cloud hyperscalers, I can't title them, however you already know them," he mentioned. "We’re within the technique of doing many extra."
These offers embody integration with Microsoft's Visible Studio Code device, GitHub CoPilot, and Google Gemini Code Help — to get solutions proper contained in the programming setting.
"All these corporations are taking, are leveraging an official licensed model of our dataset to coach their fashions for pre-training to do issues like RAG and indexing in some circumstances."
Additionally: AI bots scraping your knowledge? This free device offers these pesky crawlers the run-around
Solutions get surfaced on ChatGPT or different LLMs, with a credit score and a hyperlink again to Stack Overflow. "The aim is to really recapture the site visitors that folks gave to our web site instantly," he mentioned. "We have gotten extra of a headless web site: if individuals are spending all their time in gen AI bots, that's additionally high-quality."
After all, there's the danger of not having direct relationships with the tip person if they aren’t truly on the Stack Overflow website. Chandrasekar mentioned the corporate has numerous agreements to get pertinent data from OpenAI and others about issues such because the immediate that the chatbot person is utilizing.
"There are quite a lot of subtleties within the engagement between the associate and us," he informed me. "We’re working via every state of affairs," he mentioned, similar to how a lot of an LLM's "context window" (the latest reminiscence of chats) is shared with Stack Overflow.
AI's shortcomings
Chandrasekar mentioned that Stack Overflow is fixing three main shortcomings of the expertise for OpenAI and the opposite giants.
Additionally: Why scaling agentic AI is a marathon, not a dash
One shortcoming is what he calls the belief level. "You don't belief what's popping out of it," alluding to the notorious LLM hallucinations and confabulations.
The second shortcoming is LLM mind drain. "Should you don't generate new data, these LLMs will not be going to progress of their intelligence stage," as is evident from the controversies over so-called artificial knowledge that may pollute LLMs.
Final, and maybe most essential, "the solutions popping out of the gen AI are literally not data," mentioned Chandrasekar. "There could also be a solution, however they might be tapped out on complexity as a result of that is too difficult of a set of circumstances," and due to this fact, actually must have the wealthy context of Stack Overflow, he mentioned.
Groups integration
On the similar time that he has inked offers with the giants for the general public Stack Overflow, Chandrasekar has begun a second a part of the data options enterprise. Stack Overflow is integrating its public content material into the Groups product for firms that need to increase their inside data assets for the aim of agentic AI.
The Groups product was launched as a result of corporations mentioned they wished their very own model of Stack Overflow as a repository not for common programming data however for his or her explicit company processes. Now, mentioned Chandrasekar, the identical corporations need to increase that data pool as a result of they need to develop AI brokers that do much more than programming.
"We now have corporations like Uber who’ve truly finished this with us already, the place they've constructed an AI chatbot that's referred to as an assistant, or, in a really beneficiant sense, agent as a result of it's all about performing the motion. However that agent is leveraging the Groups knowledge for one thing like, 'How will we truly do that' inside Uber? The bot serves up an computerized reply inside a company chat. "The bot has all of the data from the staff, so it's surfacing the suitable data on the proper time."
Additionally: Stack Overflow CTO: From bootstrapped to scaling one of the Web's biggest properties
In different phrases, agentic AI automates what staff historically do with Groups. "We seen this as a result of our APIs are red-hot; our APIs are being hit continually by the bot."
On condition that development, Chandrasekar is including a brand new factor to Groups, licensing your entire Stack Overflow public website knowledge to the identical Groups prospects.
"We mentioned, 'Why don't we take our data options product, our public platform knowledge, and in addition current that to corporations alongside Groups knowledge?' An agent then can have the data from 60 million questions and solutions, and in addition all of the data particular to the corporate, after which it's much more armed with the suitable reply on the proper time."
The Groups product, like the massive licensing offers, is built-in into numerous merchandise, similar to Atlassian's JIRA IT ticketing system.
Stack Overflow is negotiating licensing phrases with enterprise Groups customers. Chandrasekar declined to debate pricing particulars aside from to say, "It's a value-based pricing mannequin."
Additionally: 10 skilled builders on vibe coding's true promise and peril
"It's early phases," he mentioned, in figuring out what the market will enable for such content material licensing.
Web site enhancements
Similtaneously data options are being developed, the general public Stack Overflow website is receiving some pretty vital enhancements.
The corporate continues to be "working our manner again" to the extent of site visitors previous to ChatGPT, mentioned Chandrasekar, with out disclosing site visitors numbers, including the location "has not but absolutely" made it again to the pre-ChatGPT stage.
Chandrasekar is constructing out the performance of the general public website to make it extra real-time.
The normal mode of Stack Overflow is one particular person posting a query after which others posting their prompt solutions.
Two different types of alternate are buried deep throughout the website — chats and discussions. Chandrasekar describes these as "swim lane" modes of interplay that don't present the perfection of the principle stack — overflow solutions — however can get a response to somebody far more shortly.
Chandrasekar's philosophy is that "we need to present a number of kind components and venture varieties for technologists of various varieties."
The corporate can also be fascinated by including tutorial video content material from customers. "There's quite a lot of nice content material," he noticed. "Think about if someone is testing DeepSeek and we’re capable of capable of, let's say, dwell stream that and seize it and someone else can study from that have. We actually need to go from being a data base into far more of a neighborhood website."
If this sounds to you want Reddit, Chandrasekar mentioned he will get that loads. The distinction, he famous, is that "we’re clearly a really specialised viewers; we’re not attempting to be all issues to all folks." He admires the bigger social website. "I do know them very properly, and so they've been nice. They’re very a lot a detailed cousin to us, or, possibly, an even bigger brother."
He noticed that Reddit's licensing offers with OpenAI helped pave the best way for the data options enterprise.
There may be at all times the hazard in increasing a profitable property that one can unfold one's efforts too skinny. How does Stack Overflow place its bets?
"We need to decide those that resonate probably the most with our customers," he mentioned. "I discuss to the neighborhood loads, principally have interaction with them to grasp which of them to double down on. We are actually going and operating exams on which of them truly make a distinction."
Additionally: Chatbots are distorting information – even for paid customers
The connection internally with gen AI has additionally modified. Early on, when ChatGPT got here public, some customers of Stack Overflow had been grabbing ready-made solutions from the bot. The positioning responded by banning the cut-and-paste replies.
"However then, we mentioned, let's discuss to the neighborhood and see how they wished to do issues," he mentioned. "One factor that turned apparent is that folks nonetheless discovered it to be pretty tough to simply have interaction human to people on asking questions."
Historically, human moderation on the location may result in moderators scolding repetitive or beginner questions. "Should you're asking a query a few technical topic, if somebody had answered earlier than, someone would inform you, that's a improper query. Go search earlier than you ask."
There got here "an enormous alternative to make use of AI."
The positioning lately went dwell with "Gemini-powered" solutions. Now, "Gemini is supplying you with a prompting it's all been requested and answered " and taking you to the related itemizing, "all in a non-public window of simply you and the AI," so there's no disgrace in your beginner inquiry.
The modifications to Stack Overflow's public website are the newest initiative, however the licensing offers and the additions to Groups appear to be serving to the enterprise already.
Going ahead
"We're rising as an organization," mentioned Chandrasekar, whereas declining to reveal financials. Stack Overflow is owned by European funding big Prosus NV of The Netherlands, which acquired it 4 years in the past for $1.8 billion.
Prosus is publicly listed, so the corporate could disclose precise monetary details about Stack Overflow when it broadcasts its full fiscal yr report, which it often does towards the tip of June yearly.
The data options a part of the enterprise has develop into Stack Overflow's fastest-growing enterprise, adopted by the Groups gross sales, and the promoting enterprise, which is a "very regular" enterprise due to the fixed demand to promote to programmers the place they spend time. Every of the three is a 3rd of income, roughly, mentioned Chandrasekar.
Maybe the stickiest a part of that is how the neighborhood handles it. There was pushback when the corporate first approached Google and OpenAI about knowledge licensing — not stunning, because the neighborhood considers the 60 million questions their neighborhood property in a way, even whether it is legally the property of the Stack Overflow company.
There was a lot pushback that some customers mentioned they had been banned from the location in the event that they prompted a stir over the licensing offers.
With out entering into the main points of previous conflicts, mentioned Chandrasekar, at this level, Stack Overflow customers have come to appreciate that the corporate is just not a not-for-profit and that it must make an revenue to serve its function.
Additionally: Stack Overflow could suspend your account if you change a post to protest OpenAI's deal
"I did an AMA two weeks in the past," an "Ask Me Something" with customers on Stack Overflow, he recalled. "I mentioned, look, we're not like another website on the market, however we don't take donations. You have got a enterprise that helps the location, and one option to drive a enterprise is to leverage what you have got that's helpful so as to add worth within the ecosystem."
The outcome, he mentioned, is that "they realized it is a good factor for Stack, and for them, as a result of by not doing this, we are literally not capturing the income that we’d like to have the ability to make investments again into the neighborhood, to construct the moderator instruments that they want. So, the neighborhood understands it now, slowly."
Need extra tales about AI? Sign up for Innovation, our weekly e-newsletter.