With rising moderator controversies, decline in user numbers, Stack Overflow announced the integration of generative AI on their platform with Overflow AI – something the company has been hinting about since April. Through this, Stack Overflow aims to improve quality and relevance of resources to user queries. The feature will not only be available on the public platform but also extend to Stack Overflow for Teams. Will this be a promising step for Stack Overflow towards getting back to being relevant again?
OverflowAI helps you create content within your Stack Overflow for Teams community, by taking existing sources within your instance and creating a tagging framework and series of questions and answers. pic.twitter.com/TDIcIu7DNU
— Stack Overflow (@StackOverflow) July 27, 2023
Arms Against ChatGPT
Ever since the ChatGPT rage began last December, Q and A community platform for programmers Stack Overflow, has been in news, either for their new policies and regulations or simply for their user switch. Developers who have been using Stack Overflow for copying code started switching to ChatGPT. With the launch of Code Interpreter plugin on ChatGPT, it has only made it even more coveted as a coder platform.
When ChatGPT was finding its presence everywhere, in December last year, the company took the bold step of banning content generated by ChatGPT on their platform. In addition to attributing high inaccuracy as a reason for not allowing content from ChatGPT and other LLMs, the volume of answers that can be easily produced via GPT platforms which require a detailed review by an SME has burdened their volunteer-based quality curation infrastructure.
The company faced a 12% slump in the number of user visits – 247 million visits in December from 279 million in November. However, the company confirmed to AIM that it did not face any material impact to traffic and that the dip was a normal one brought about by a seasonal shift owing to holidays in December.
You cannot have the cake and eat it too
ChatGPT is believed to be trained on Stack Overflow data. GPT-3 paper that talks about training datasets includes Common Crawl which implies everything on the internet, including Stack Overflow.
For a company losing customers to another platform, owing to better and smoother experience for query resolution, it was only obvious that Stack overflow will next prevent ChatGPT from feeding on their datasets to better themselves or get paid – that’s pretty much what happened.
Stack Overflow, closely following Reddit, announced that it will charge AI developers for accessing its programming driven community questions. The company believes that community platforms that support LLM models should in some way be compensated for their contribution, which will help these companies reinvest back to grow the community.
The Internal Tussle
With initially blocking AI generated content on the platform leading to a number of users being suspended and posts being removed, Stack Overflow then went back to allowing AI content on the platform. With lack of uniformity, the company’s stance on AI-generated content came under scrutiny with curators, contributors and moderators participating in moderation strikes from June.
The moderators then came to an interim solution where AI-generated content will be checked against ‘strong’ and ‘weak’ heuristics, to serve as guidelines to evaluate a content’s quality and appropriateness. This would determine if a post needs to be removed or allowed on the platform. Furthermore, the moderators were able to convince Stack Overflow to continue providing data dumps and API access – something the company had been contemplating removing. Amidst the tussle, Stack Overflow finally announced generative AI.
What’s Ahead?
With Overflow AI, the company will be utilising their vast database consisting of over 58 million questions and answers from their community – a rich database exclusive to the company. Stack Overflow will integrate this database into IDEs (Integrated Development Environments) which are software applications that help developers write and manage codes. By this integration, the developers are facilitated with simpler coding resolution- akin to the multiple functions that ChatGPT’s Code Interpreter brings in.
While Stack Overflow is definitely late to the AI race, with players such as GitHub Copilot X having implemented GPT-4 in March itself, the vast database it brings might prove helpful for the company. Whether this would help bring back lost users, is something that needs to be seen.
The post Stack Overflow’s Bumpy Road to Gen AI Adoption appeared first on Analytics India Magazine.