Why Trusting AI is All a Matter of the Right Data at the Right Time

Why Trusting AI is All a Matter of the Right Data at the Right Time August 28, 2023 by Gaurav Rao, EVP and GM for AI/ML, AtScale

The world has grown accustomed to the presence of artificial intelligence (AI) in its daily lives. In fact, unless you’ve been asleep for much of 2023, you can see just how AI’s influence on the world is growing with the hype around generative AI. We all know AI has been used for years to recommend your next purchase, but it is also used by businesses looking to speed up analysis and make better, faster, more intelligent decisions. But what if that AI is unknowingly delivering bad decisions?

Sure, it's not a big deal when the AI is recommending a book that you're not interested in or a vegetable that you hate. But when it comes to financial guidance for investors, or new features for a featured product, an incorrect analysis can lead to a bad decision that will affect a business's success or worse, place individuals or groups at risk or harm.

This raises the question: Can we really trust AI to make important decisions? Especially, as more sophisticated deep learning and Generative AI models are being trained and deployed for everyday use.

The answer lies in understanding the pivotal roles data and data quality have in the machine learning lifecycle.

The Impact of Data Quality

When it comes to trusting AI, the quality of the data used to drive its decisions holds immense significance. Flawed data, whether incomplete, incorrect, or biased, can skew the accuracy of an AI's prediction. The consequences of relying on unreliable data in an AI system could potentially become catastrophic.

Imagine you're an investor seeking financial guidance and you turn to an AI-powered platform to assist you in making investment decisions. This AI relies on historical financial data to analyze market trends, identify potential opportunities, and provide recommendations. However, if the data used by the AI is incomplete or biased, perhaps because it was never fully integrated into the system, the guidance it offers may be flawed, leading you down the wrong investment path.

For another example, say the AI has access to financial data from only a limited number of sources, omitting crucial information from certain sectors or geographic regions. As a result, the AI's analysis may overlook significant market shifts or emerging trends, preventing you from capitalizing on lucrative investment opportunities.

Furthermore, consider the detrimental impact of biased data on the AI's decision-making process. Biases can seep into data collection methods or be inherent within the data itself, leading to skewed insights and recommendations. For instance, if the financial data used by the AI system predominantly represents a specific demographic or fails to account for diverse economic factors, the resulting investment recommendations may not reflect the realities of the broader market.

Relying solely on predictions based on incomplete or biased data can lead to misguided investment decisions, adversely affecting financial success and impeding business growth. The importance of data quality becomes glaringly evident when we recognize that the decisions made by an AI system can only ever be as reliable as the data available to it when training and deploying AI models. A classic example of “garbage in, garbage out.”

Just as a solid foundation is essential for the structural integrity of a building, reliable and high-quality data forms the bedrock upon which trustworthy AI can operate. Just as compromised building materials or a weak foundation can jeopardize the stability and safety of a structure, flawed data can undermine the accuracy and reliability of AI's decision-making.

To mitigate these risks, organizations must prioritize data collection processes that ensure completeness, accuracy, and lack of bias. They should strive to gather diverse and comprehensive datasets that encapsulate various sectors, demographics, and geographic regions. Employing data validation techniques, such as cross-referencing multiple sources and employing data cleaning algorithms, can help identify and rectify errors, ensuring a more accurate representation of reality.

The Importance of Robust Models

Equally crucial to data quality are the models employed to use the data for predictive insights. The reliability of AI-driven decisions heavily depends on the robustness, accuracy, and transparency of these models and on using the right data and data sets to build them.

Let’s imagine AI being used in the healthcare industry to assist doctors in diagnosing diseases. This AI model employs a complex neural network to analyze patient symptoms and provide diagnostic recommendations. While the neural network may exhibit high accuracy in identifying diseases, it lacks transparency in explaining how it arrived at its conclusions.

If you were a doctor using this prediction to aid in diagnosing a patient, you’d be able to receive a speedy diagnosis but have no understanding of the factors that contributed to that conclusion. Without transparency, it becomes challenging to trust the AI blindly, as it may overlook crucial symptoms or fail to account for important medical considerations. These challenges are common in highly regulated industries, where trust and transparency are requirements.

On the other hand, perhaps the healthcare AI tool utilizes a decision tree algorithm model instead. Decision trees are known for their interpretability, as they provide a step-by-step breakdown of the decision-making process. In this scenario, when the AI system recommends a diagnosis, you can easily trace the path of decision-making, understanding which symptoms and factors led to that conclusion. This transparency empowers a doctor to make a more informed judgment on the AI's recommendations.

For another example, imagine you are presented with two complex puzzles; one with clear instructions and transparent steps, while the other lacks any guidance or explanation. In the first puzzle, you can readily understand the logical progression, enabling you to solve it effectively. The second puzzle, however, leaves you puzzled and uncertain, making it difficult to trust your own decisions or ascertain if you've arrived at the correct solution. Guesswork, assumptions, and trial-and-error become part of the decision-making process. Robust and transparent models enable users to better comprehend the AI's decision-making process, instilling a greater sense of confidence in its recommendations.

The Role of the Semantic Layer

The semantic layer plays a crucial role in creating a common data layer that enhances the trustworthiness of AI decision making. It addresses the challenge of inconsistent data definitions and a lack of context, which as we’ve seen in the above examples, can dramatically derail AI utilization and trust. By establishing a single source of truth, a semantic layer can ensure that all AI applications are working from a common data source that has visibility into the transformation that the data has gone through each step of the way.

When different teams or individuals within an organization use different measurement units or have varying interpretations of data, it can hinder collaboration and result in conflicting conclusions. The semantic layer helps overcome this issue by providing a shared understanding. It captures the relationships, concepts, and context within the data, enabling consistent interpretation and analysis. With a common understanding of data across the organization, more trustworthy conclusions can be made, as they’re based on the same reliable source of information.

The semantic layer helps ensure that data is accurately understood, interpreted, and used consistently, fostering trust in the insights derived from AI systems.

Building Trust in AI

The real question is: How can we ensure that AI systems deliver trustworthy decisions?

To improve trust in AI, organizations must prioritize the development and utilization of models that exhibit robustness, accuracy, and transparency. This entails employing techniques such as explainable AI, where the inner workings of the model are made understandable and interpretable. Additionally, organizations can embrace model evaluation methodologies, leveraging techniques like sensitivity analysis or performance metrics to assess the reliability and effectiveness of the models. By embracing the creation of a semantic data layer, AI decision making can become more reliable, transparent, and informed because of the common source. Without these initiatives, AI will never be seen as a reliable and trustworthy decision-making partner.

Alongside meticulous scrutiny, asking the right questions is paramount in establishing trust with AI. By posing pertinent inquiries to any AI, we can ascertain its reliability in making critical decisions on our behalf.

Here are some important questions to consider:

  • What data is being used to train and guide the AI? Is it complete, accurate, and unbiased?
  • How are the models constructed for data analysis? Are they accurate, robust, and transparent?
  • How do the AI's decisions compare to those made by human experts as part of the feedback loop?

Only by ensuring the data is complete, accurate, and unbiased, and by utilizing accurate, robust, and transparent models, can we really begin to trust AI to make sound decisions. Trustworthy AI can become a catalyst for progress, but only if we take the necessary actions to help it evolve along the way. Generative AI will only magnify this challenge which is why a solid data foundation is crucial and more important now than ever before.

About the Author:

Gaurav Rao is the EVP and GM for AI/ML, leading all AI/ML strategy for AtScale. Prior to AtScale, Gaurav served as VP of Product at Neural Magic. Before Neural Magic, he served in a number of executive roles at IBM spanning product, engineering, and sales. He is also an advisor to data and AI companies.

Related

Meet Sam Altman’s Hyderabad Companion

When Sam Altman visited India, he was accompanied by two Indian-origin associates, namely Sandhini Agarwal and Atty Eleti, who saved Altman from answering policy and technical-related questions. AIM was lucky enough to get in touch with the latter, who now works as a software engineer at OpenAI, building APIs and other super cool stuff that he held himself from disclosing, even by mistake.

Athyuttam Eleti, aka Atty Eleti, joined OpenAI precisely when the company launched ChatGPT to the world. “I joined on November 7th, last year, two weeks before GPT was launched. I really thought I was joining a low key research lab. But it turns out, I was joining a consumer sensation. It’s been a really exciting ten months so far.” he shared, in an exclusive conversation with AIM.

He said that OpenAI just sort of happened to him. “I had always known about OpenAI. The company’s co-founder, Greg Brockman, was previously the CTO of Stripe, a payment processing company, where I worked, and some team-mates from Stripe joined OpenAI over the years.

Further, he said: What really got me was when my former roommate, Nicholas Turley, now the product manager at OpenAI, refused to talk about a project but would constantly say how ‘transformational,’ it was. “I did some research on GPT-3 and DALL.E and something clicked,” he said, saying that natural language interfaces are finally possible and there are hints of reasoning ability that insert intelligence in this model.

“And I can’t describe it any more than a gut feeling that this is the future. I applied and went through the due process and was lucky to get an offer,” he added.

The Interview

The interview process at OpenAI was extensive, “For the role I applied for, product engineer on the API team the interviews focused on product skills and engineering. There were multiple interview rounds at OpenAI,” he explained.

Eleti said one of them was a product sense interview, where he interacted with the designer, discussing the product philosophy, design flaws, and solutions, among other things. He said that the second interview was around systems design, where he had to design a backend for an app, like Slack or Uber.

He further explained that there was also a front-end interview, where he had to build a relatively complex UI. The interviewers closely examine the UI, focusing on details.

Additionally, he also said that there are broader career goals and behavioural interviews conducted by the hiring manager. “During this interview, they assess the candidate’s alignment with the company and reasons for wanting to work there,” added Eleti.

“I did the Stripe interview as a new graduate many years ago so they’re kind of different, but they were both hard!” he recalled, comparing his experience with the OpenAI interview.

Background

Eleti is a graduate of Brown University and is deeply interested in graphic design. While in university at Brown he took advantage of Rhode Island School of Design (RISD) and took classes on branding, typography, etc.

He wanted to be an animator at school and spoke about how he would experiment with Photoshop and 3D software on projects. “There’s always an interest in design and art but at RISD I tried to balance both of those things during my time in college, and in particular, was very interested in UI and UX design. And all my internships had the added component of that,” he added.

He interned at Facebook (now Meta), Kayak (a travel search engine platform), and Figma, as a design intern, before joining Stripe as a payment processing engineer.“I decided to double down and focus more on engineering as a full time career after my stints as a designer,” he said, explaining that he wanted to build the whole software product.

In January 2022, Eleti decided to venture into developing API software and enrolled into Y Combinator’s startup accelerator programme. “Six months into the endeavour, I realised that it wasn’t the right idea or the right time. I took some time off, it was during this period that I grew curious about OpenAI,” he added.

What’s next?

“My innings has only just begun and with OpenAI, I’m very, very excited to be a part of the change that’s about to happen in the human-computer interaction,” said Eleti.

He said he is excited to see newer models being launched, alongside building ambitious products and tools that enable developers. “I’m very interested in seeing how the journey plays out and see what kind of platform shift AI plays,” said Eleti, saying that eventually down the line would like to start his own company. “I am a builder at heart, and I want to keep building,” he concluded.

The post Meet Sam Altman’s Hyderabad Companion appeared first on Analytics India Magazine.

With Plunging WLD Token Prices, Where is Worldcoin Headed?

When Worldcoin was officially launched a month ago, people were thronging orb centres across the globe to get their irises scanned. The initial buzz that generated mayhem seems to be marred by a number of challenges now. The value of worldcoin has dropped by 46% since its launch and is continuously spiralling down. So, where exactly is Worldcoin headed?

Fading Hype

According to Santiment, a behaviour analytics platform for cryptocurrencies, the social volume and social dominance of worldcoin tokens (WLD) over the last 30 days has dropped by 95% and 74% respectively, indicating a decline in discussions and hype of the project.

The value of WLD which stood at $2.34 at the start of the month has now hit a low of $1.26. With the value dropping, all those who gained crypto tokens on successful iris scans, have lost out on market capitalisation.

Source: CoinDesk

Interestingly, the US did not launch Worldcoin which may have lost a large portion of global users. The country has the largest crypto users in the world with over 44.3 million users—13.7% of the country’s population own cryptocurrencies. Ironically, the country where crypto markets are governed by a number of regulatory bodies such as the Securities and Exchange Commision (SEC), the Commodity Futures Trading Commision (CFTC) and the Federal Trade Commission (FTC), did not find Worldcoin suitable.

In addition to offering crypto tokens (WLD) for every registration, and with the broader goal of providing universal basic income, Worldcoin was marketed as a proof of personhood product which essentially might serve as a digital identity in the future. As ambitious as the plans sounded, it was quickly entwined in regulatory and privacy concerns.

Internal Cracks

In a recent development, an alleged whistleblower from the organisation announced that he has cut all ties with the company and will help authorities conduct justified probes. The person released a video where he talks about what led to his exit but did not explain further owing to legal formalities. He emphasised on how he was aligned with the vision of the company but the execution was “horrendous.” He also termed the company’s white paper as deeply flawed.

In the last few weeks, the company has been going all out on bringing forth transparency in their operations. Worldcoin released pictures and explanations on the various parts of the orb – highlighting each component’s use. The organisation also shared documents on Privacy FAQs highlighting how World IDs and biometrics are used. There was also another document on entropy that spoke about the importance of iris scanning for proving personhood as opposed to faces and fingerprint scanning.

Through a series of such updates, Worldcoin has been desperately trying to prove its operations to be legit and safe. However, none of it reflected in people’s sentiments which is clearly visible from the downward trajectory of WLD price.

World ID is designed to enable anonymous use. Actions taken with World ID are not linked to a person's iris images or iris code, & third parties cannot link different actions through World ID data.

— Worldcoin (@worldcoin) August 24, 2023

Countries Rebel

With countries raising the alarm on Worldcoin operations, a number of them opened investigations against it. Kenya was the first country to ban Worldcoin operations citing privacy concerns. In Nairobi, local media reported Kenyan police to have conducted a search warrant where documents and machines belonging to Worldcoin were seized. The government has thus formed a 15-member parliamentary committee to investigate the same and has a 42-day window to investigate and submit a report to the House committee.

Traversing from Africa, regulatory action hit South America too. Two weeks ago, Argentina opened up investigations against Worldcoin. The Agency for Access to Public Information (AAIP) which serves as the data overseer in the nation will probe into Worldcoin’s handling of personal data.

The EU known for watchful scrutiny was one of the first ones to raise concerns. France, Germany and the UK had opened up investigations right after Worldcoin was launched, however no ban has been placed on its operations. Though no form of scrutiny was raised against in India, orb operations were shut in all three centres in Bengaluru.

Despite the fall, it may be too early to seal the fate of Worldcoin. Considering how crypto markets are highly volatile, and it has only been a month since its official launch, it is possible that things can change. However, if the trend continues with WLD losing its market cap, and people losing interest, redemption may be a tough task.

The post With Plunging WLD Token Prices, Where is Worldcoin Headed? appeared first on Analytics India Magazine.

Creating A Simple Docker Data Science Image

Creating A Simple Docker Data Science Setup
Image created by Author with Midjourney Why Docker for Data Science?

As a data scientist, having a standardized and portable environment for analysis and modeling is crucial. Docker provides an excellent way to create reusable and sharable data science environments. In this article, we'll walk through the steps to set up a basic data science environment using Docker.

Why is it we would consider using Docker? Docker allows data scientists to create isolated and reproducible environments for their work. Some key advantages of using Docker include:

  • Consistency — The same environment can be replicated across different machines. No more "it works on my machine" issues.
  • Portability — Docker environments can easily be shared and deployed across multiple platforms.
  • Isolation — Containers isolate dependencies and libraries needed for different projects. No more conflicts!
  • Scalability — It's easy to scale an application built inside Docker by spinning up more containers.
  • Collaboration — Docker enables collaboration by allowing teams to share development environments.

Step 1: Creating a Dockerfile

The starting point for any Docker environment is the Dockerfile. This text file contains instructions for building the Docker image.

Let's create a basic Dockerfile for a Python data science environment and save it as 'Dockerfile' without an extension.

# Use official Python image  FROM python:3.9-slim-buster    # Set environment variable  ENV PYTHONUNBUFFERED 1    # Install Python libraries   RUN pip install numpy pandas matplotlib scikit-learn jupyter    # Run Jupyter by default  CMD ["jupyter", "lab", "--ip='0.0.0.0'", "--allow-root"]

This Dockerfile uses the official Python image and installs some popular data science libraries on top of it. The last line defines the default command to run Jupyter Lab when a container is started.

Step 2: Building the Docker Image

Now we can build the image using the docker build command:

docker build -t ds-python .

This will create an image tagged ds-python based on our Dockerfile.

Building the image may take a few minutes as all the dependencies are installed. Once complete, we can view our local Docker images using docker images.

Step 3: Running a Container

With the image built, we can now launch a container:

docker run -p 8888:8888 ds-python

This will start a Jupyter Lab instance and map port 8888 on the host to 8888 in the container.

We can now navigate to localhost:8888 in a browser to access Jupyter and start running notebooks!

Step 4: Sharing and Deploying the Image

A key benefit of Docker is the ability to share and deploy images across environments.

To save an image to tar archive, run:

docker save -o ds-python.tar ds-python

This tarball can then be loaded on any other system with Docker installed via:

docker load -i ds-python.tar

We can also push images to a Docker registry like Docker Hub to share with others publicly or privately within an organization.

To push the image to Docker Hub:

  1. Create a Docker Hub account if you don't already have one
  2. Log in to Docker Hub from the command line using docker login
  3. Tag the image with your Docker Hub username: docker tag ds-python yourusername/ds-python
  4. Push the image: docker push yourusername/ds-python

The ds-python image is now hosted on Docker Hub. Other users can pull the image by running:

docker pull yourusername/ds-python

For private repositories, you can create an organization and add users. This allows you to share Docker images securely within teams.

Step 5: Loading and Running the Image

To load and run the Docker image on another system:

  1. Copy over the ds-python.tar file to the new system
  2. Load the image using docker load -i ds-python.tar
  3. Start a container using docker run -p 8888:8888 ds-python
  4. Access Jupyter Lab at localhost:8888

That's it! The ds-python image is now ready to use on the new system.

Final Thoughts

This gives you a quick primer on setting up a reproducible data science environment with Docker. Some additional best practices to consider:

  • Use smaller base images like Python slim to optimize image size
  • Leverage Docker volumes for data persistence and sharing
  • Follow security principles like avoiding running containers as root
  • Use Docker Compose for defining and running multi-container applications

I hope you find this intro helpful. Docker enables tons of possibilities for streamlining and scaling data science workflows.

Matthew Mayo (@mattmayo13) is a Data Scientist and the Editor-in-Chief of KDnuggets, the seminal online Data Science and Machine Learning resource. His interests lie in natural language processing, algorithm design and optimization, unsupervised learning, neural networks, and automated approaches to machine learning. Matthew holds a Master's degree in computer science and a graduate diploma in data mining. He can be reached at editor1 at kdnuggets[dot]com.

More On This Topic

  • An Overview of Mercury: Creating Data Science Portfolio and Notebook Based…
  • Docker for Data Science Cheat Sheet
  • Winning The Room: Creating and Delivering an Effective Data-Driven…
  • Creating Good Meaningful Plots: Some Principles
  • Creating a Web Application to Extract Topics from Audio with Python
  • The Challenges of Creating Features for Machine Learning

Automate Any Task in Google Sheets With SheetGPT, Now $48.99

A businessman using a spreadsheet program on a laptop.
Image: StackCommerce

Google Sheets is a great platform for working with data, particularly in remote teams. However, it has all the same limitations as any other spreadsheet editor. That is, unless you sprinkle in a little AI magic via SheetGPT.

This top-rated tool lets you automate a variety of tasks using ChatGPT, and it’s really easy to use. You would normally pay $299 for lifetime access, but you can grab the app today for only $48.99 via TechRepublic Academy.

From crunching sales data to crafting a content marketing strategy, we all use spreadsheets to organize data. It’s a system that works well — but you do have to spend time entering, importing and formatting new information.

What if you could automate these processes? You could spend many more hours doing something more productive in your business.

SheetGPT opens up this possibility. Once you install this add-on and hook up your OpenAI account, you can access an AI assistant whenever you want.

For instance, you can use SheetGPT to summarize a document in a few words or write a draft article. The app can also generate social media content, categorize data, classify information and cleanse anything that is unusable.

Using SheetGPT is a bit like creating a formula. You simply select a cell and type =AI(“Your Prompt Here”). Almost instantly, you get a response from ChatGPT that fills the cell.

You can also use a single prompt across multiple cells and type in a URL to reference online content. In the context of marketing, sales and data analysis, it has the potential to save a huge amount of time and effort.

This deal offers lifetime access for one user, with unlimited usage (you just need an OpenAI account). SheetGPT is rated 5/5 stars on AppSumo and SourceForge.

Get a SheetGPT Single User Plan: Lifetime Subscription for only $48.99 (reg. $299).

Prices and availability are subject to change.

Subscribe to the Innovation Insider Newsletter

Catch up on the latest tech innovations that are changing the world, including IoT, 5G, the latest about phones, security, smart cities, AI, robotics, and more.

Delivered Tuesdays and Fridays Sign up today

Google Teaches ChatGPT How to Solve Math Problems

AI chatbot ChatGPT might excel in tasks like script writing, explaining complex topics, debugging, code explaining and others, but it performs really poorly in maths.

Stanford University and University of California, Berkeley recently published a research paper and stated that large language models (LLMs) can perform simple maths operations when numbers are small, but struggles with large numbers suggesting that LLMs have not learned the underlying rules needed to perform these arithmetic operations. It further mentioned, even with GPT-4 improvements on the MATHS dataset, errors largely occur due to arithmetic and calculation mistakes.

The rival company Google has acknowledged the issue in LLMs and stepped in to teach models akin to ChatGPT to reason better algorithmically.

The work by Google researchers titled, ‘Teaching language models to reason algorithmically’, takes in the in-context learning approach and introduces an algorithm better at reasoning.

In-context learning is teaching a new skill where the researchers guide it through the process step-by-step instead of overwhelming it with all instructions upfront. The method refers to a model’s ability to perform a task after seeing a few examples of it within the context of the model.

They also presented a prompting technique for general purpose language models to have strong generalisation on maths problems more difficult than the ones in prompt. The technique builds upon other rationale-augmented approaches (e.g., scratchpad and chain-of-thought). Lastly, they demonstrated that a model can ‘reliably execute algorithms on out-of-distribution examples with the right prompts.

A Below Average Student

ChatGPT has become worse at performing certain basic maths operations — as it is getting better at other things. The same study highlighted that the high profile chatbot is getting worse as compared to its performance earlier in March.

Researchers said the deterioration is due to an AI phenomenon known as drift, where attempts to improve one part of complex models make other parts of the models worse.

To track performance, James Zou, Stanford professor affiliated with the school’s AI lab and his colleagues, Matei Zaharia and Lingjiao Chen fed ChatGPT 1,000 different numbers. In March, the paid GPT-4 version impressively identified whether 84% of the numbers were prime or not. By June the success rate dropped to 51%.

Apart from getting the answers wrong, ChatGPT also got a thumbs down for its attempt to show the researchers how it arrived at certain conclusions. As part of the research the researchers additionally asked the chatbot to lay out its “chain of thought,” the term for when a chatbot explains its reasoning. In March, it did so, but by June it stopped showing its step-by-step reasoning.

The recent Google study tries to tackle this issue with its in-context learning approach. These discoveries suggest that exploring longer contexts, and prompting more informative explanations could provide valuable research.

The Wolfram Headmasters

A pioneer in fusing technology with maths education, Wolfram Research has been working with ChatGPT’s parent OpenAI, to bring better maths capabilities in AI models. ”We have seen some interesting results with our LLM. I tried to run a British ‘A’ level maths, an exam students take before University, and ChatGPT alone got 43% which is quite impressive, but Wolfram plus ChatGPT got 96%,” cofounder of the company, Conrad Wolfram revealed in an interview with AIM.

“Game over for humans on that one”, he quipped.

Notably, when a same maths teaser was thrown at ChatGPT version 3.5, 4, and Wolfram Plugin — what is the smallest integer greater than 95,555, in which there will be 4 identical numbers? — only the latter got it right in the first attempt.

The Wolfram + ChatGPT plugin not only solves maths step-by-step but it can also present them visually if specifically prompted to do so. Based on the prompts, it can go a step further and represent the data in different forms like graphs, charts, and histograms.

The plugin can turn queries in natural language into beautiful mathematical equations. It can do so since it combines ChatGPT’s human mimicking technology and Wolfram’s strong foundation of symbolic programming language that focuses on expressing ideas in a computational form.

On one hand,Wolfram is making strides with its plug-in and on the other, researchers show models performance worsening. In the current landscape, Google’s latest in-context learning approach can help AI chatbots become an above-average student.

The post Google Teaches ChatGPT How to Solve Math Problems appeared first on Analytics India Magazine.

What’s Brewing after Temenos Partners with MongoDB?

In May, banking software giant Temenos released impressive performance results for its Temenos Banking Cloud platform that were achieved using Microsoft Azure and MongoDB Atlas infrastructure.

Founded in 1993, Geneva-based Temenos has become a global leader in the fintech industry, offering a wide range of software products and services tailored to the needs of financial institutions.

The Temenos Banking Cloud demonstrated remarkable scalability, managing 200 million embedded finance loans and 100 million retail accounts, achieving 150,000 transactions per second. It supports banks’ growth via BaaS or independent product distribution, excelling in core transactions, payments, security, data, and digital channels. Collaborating with Microsoft and MongoDB, the test highlighted Temenos’ adaptable platform for high BaaS transaction volumes across multiple brands.

AIM caught up with Wei You Pan, Principal, Financial Industry Solutions, MongoDB and Ganesan Sriraman, EVP, Product Engineering, Temenos to understand the importance of the results.

“MongoDB’s architecture was used to boost the flexibility, scalability, and security of the Temenos banking platform in several ways,” said Pan. The schema flexibility of MongoDB’s document model allows for easy adaptation of business requirements across different customers, countries, and regions. This is crucial for the dynamic needs of the banking industry.

Soaring Global Payments

Temenos’ single platform caters to different banks, particularly those with larger and more diverse businesses dealing with extensive and complex data processing demands in today’s hyper-digitalised banking landscape. This solution aligns with the industry-wide challenges faced by financial institutions, addressing operational data capture and processing needs.

Similarly, MongoDB, like Temenos, caters to global financial institutions and banks, supporting on-premise, cloud-based self-managed, and SaaS application deployments.

“Through MongoDB, Temenos facilitates its customers in deploying applications across these deployment modes, including AWS, Azure, and Google Cloud through MongoDB Atlas,” said Sriraman of Temenos.

This multi-cloud approach is essential for accommodating customer preferences. Notably, MongoDB Atlas allows for the creation of multi-cloud clusters, even distributing nodes across different cloud service providers within a single cluster to enhance reliability and regulatory compliance.

For example, Temenos’ real-world impact involves a global payment provider launching an “Buy Now Pay Later” embedded lending product on the Temenos Banking Cloud, achieving exceptional scalability and serving 200 million loans across numerous countries in just over three years in line with the demands of the constantly available “moment economy” and the robustness required in the realm of embedded finance.

Keeping Sustainable at Core

“We reinforce our commitment towards sustainability through our products, like the carbon emissions calculator on Temenos Banking Cloud for banks’ net zero goals,” added Sriraman.

This independently verified solution is embedded into the Temenos Banking Cloud and offered at no cost to customers, who can benefit from over 90% in carbon emissions savings compared to on premise IT infrastructures and applications. It gives clients deeper insight into their carbon emissions data, allowing them to track progress toward their sustainability targets.

Furthermore, Temenos core banking product has become over 30% more carbon efficient in the last 12 months with more year-on-year improvements underway.

Tech Stack

“We have embraced innovative technologies, particularly Explainable AI, over the past two years,having a significant impact on businesses and clients in the banking sector,” said Sriraman.

With their Explainable AI capabilities, the company aims to offer transparent AI decision-making that can be easily understood by customers and regulators. These capabilities are integrated into various aspects of their solutions, including wealth management, anti money laundering, credit scoring, customer management, and more. According to him, the incorporation of AI and machine learning has led to the creation of explainable models that enhance customer experiences and automate processes.

Temenos’ open platform enables clients to design and deliver digital experiences using low code or no code methods. By decoupling new banking functionalities from the underlying technology, Temenos ensures a rich feature set while employing modern and open technology. They offer a unified code base, ensuring that every technological investment benefits all clients. Through the adoption of open APIs and event-driven microservices, Temenos allows banks to integrate the latest technology while leveraging established functionality from over 150 countries. The platform provides flexibility in deployment, accommodating on-premise, private or public cloud, and SaaS solutions through the Temenos Banking Cloud.

MongoDB’s Role in the Growth of BaaS

“The surge in Banking-as-a-Service (BaaS) adoption can be attributed to the adoption of Open Banking principles by global regulators, fostering growth driven by regulations and market forces, allowing financial institutions to expand their services through Third-Party Providers (TPP), overcoming challenges related to data privacy and compliance,” said Pan.

BaaS offers banks the opportunity to reach a wider customer base through TPP channels at a reduced cost. Regulatory support and the acceleration prompted by the pandemic have been key drivers of BaaS, while technology, particularly API-focused collaborations and advanced data management, has facilitated its adoption.

He further added regulatory backing has not only enabled BaaS but also provided financial institutions with avenues for growth via TPPs. This has been crucial in surmounting data privacy and compliance obstacles, which could have hindered BaaS adoption. For banks, BaaS offers a chance to access a broader customer base through TPPs at a lower cost compared to traditional methods.

The Covid-19 pandemic has acted as a catalyst, accelerating the adoption of digital banking and further fueling the rise of BaaS. According to Pan, technology has played a vital role, with API-based collaborations and data management supporting BaaS interactions. The prevalence of APIs in BaaS requires banks to be highly scalable, a challenge addressed by advancements in public cloud technology. This technology offers on-demand resource scaling and cost optimisation, enhancing banks’ ability to handle increased TPP activity effectively.

“The partnership between MongoDB and Temenos ensures that such advancements are effectively integrated and leveraged,” concluded Pan. On the horizon, MongoDB’s developments include an innovative vector search feature for AI integration, a collaborative AI initiative with Google Cloud using Vertex AI, and the introduction of MongoDB Atlas tailored for the financial services sector.

Read more: MongoDB Ups the Ante with Vector Search for Generative AI

The post What’s Brewing after Temenos Partners with MongoDB? appeared first on Analytics India Magazine.

Ideogram Just Dropped a Big Revelation for Midjourney

Since its release, every day, around a lakh people on the internet use Midjourney, the AI art generating tool for the first time. But the fame is fading out dramatically. The first major impact came when the platform stopped free trials. Another reason for the loss of fame is that the platform is not intuitive to use as it can only be accessed through Discord. The task of jumping through all the hoops from ‘getting Discord, joining the beta, signing up, and then landing on the Discord server’ is repelling for users.

It’s been more than a year since the launch of Midjourney and the tool still does not give users the option like others to directly sign up, pay, prompt and get results. Users have long sought a custom UX with a prompt builder along with pull-down menus (to choose style, render quality, etc.).

While users were trying to ease the pain-in-neck process on Midjourney through bots, a few former Googlers noticed the uncharted territory.

This month, the team introduced Ideogram AI, a text-to-image tool similar to Midjourney.

While the market is already dominated by Midjourney, Adobe’s Firefly, and OpenAI’s Dalle-2, et al. users claim Ideogram’s model has reliable text generation, which could give the new entrant an edge for generating logos. In fact, Ideogram’s ‘superfast powerful brain going to the right’ logo was also generated by the platform!

Did you know our logo is a super fast powerful brain going to the right?
Do you have a guess who generated the logo? pic.twitter.com/yHpBxNfynX

— Ideogram (@ideogram_ai) August 25, 2023

Nevin Thomas, creative head at AIM experiments with such AI tools regularly. His LinkedIn post based on his latest experiment with Ideogram reads, “Where one AI tool fails another shines! #Midjourney is miles better than Ideogram (at this stage at least) when it comes to generating images from text prompts but Ideogram seems to be ahead when it comes to putting text on the image.” He prompted Midjourney and Ideogram with the same text description — Astronauts at a space station holding a sign that says, “WE ARE HERE”, photo.

The result shows a startling difference between the two AI-powered platforms in understanding the prompts. Moreover, Ideogram runs in the browser and directly integrates social media features making it easier for users like Thomas to explore tools alike.

Developed by three former Google research scientists — Chitwan Saharia, William Chan, and Jonathan Ho, who have worked on the tech goliath’s AI projects such as Imagen, Google’s own text-to-image system. The team launched the platform backed by A16z and others with $16.5m in seed financing to develop state-of-the-art tools.

The (Legal) Rabbit Hole

Artificial intelligence has been having a tryst with art for quite some time. From restoring Klimt’s legendary ‘Trio’ to Midjourney’s game-changing generative AI fill the art-tech community has come a long way within a few years. While AI art has encouraged users to create 15+ billion images within months, the tools are constantly being raised fingers at by ethicists and art connoisseurs.

For instance, Sarah Andersen, Kelly McK­er­nan, and Karla Ortiz dragged Midjourney, Deviant Art and Stability.AI to court for using their work without consent. Even though the heap of accusations has been lengthening, the laws are in favour of these tools.

Welp, didn’t take long for DALL-E to pivot to paid model and DOR has summed up pretty succinctly how that’s a problem pic.twitter.com/8NLSwS437L

— Dan Kelly (@dananthonykelly) July 22, 2022

As a result, people like David Holz, founder of Midjourney “don’t really wanna be involved in” the plagiarism issues that haunt the internet. Pointing at his relaxed view on data theft, John Oliver quipped, saying, “I am not really surprised. He looks like hipster Willy Wonka answering a question on whether importing Oompa Loompas makes him a slave owner.”

While all sorts of thorny legal questions are being raised at these dreamy art generators, Adobe is confident that its Firefly won’t breach any copyright laws. The design software provider is so sure that it guarantees to compensate businesses if they’re sued for infringement over any image its tool creates.

For its part, Ideogram states that it has a focus on creativity with a “high standard for trust and safety.” The latest contender has become successful overnight as users are already testing typography with its skills. Moreover, OpenAI’s Andrej Karpathy and Cohere’s co-founder Nick Frosst are some of the first followers of the tool’s social media page. While the tool has just gotten into the hands of users, we wait to see how long it works in Ideogram’s favour.

The post Ideogram Just Dropped a Big Revelation for Midjourney appeared first on Analytics India Magazine.

Why NVIDIA Partnered with VMware

Last week, at its Explore event, VMware introduced Private AI for businesses to develop on-premise generative AI, in their own data centres rather than the cloud. Private AI brings AI models to where data is generated, processed, and consumed. At the same event, the company also announced its partnership with NVIDIA.

The partnership aimed to enable enterprises to customise models and run generative AI applications, including intelligent chatbots, assistants, search and summarisation. The platform will be a fully integrated solution featuring generative AI software and accelerated computing from NVIDIA, built on VMware Cloud Foundation and optimised for AI.

VMware CEO, Raghu Raghuram, said, “With VMware Private AI, we are empowering our customers to tap into their trusted data so they can build and run AI models quickly and more securely in their multi-cloud environment.”

This looks like a great opportunity for VMware, but what’s in it for NVIDIA?

With this partnership, NVIDIA plans to diversify its own services—which comes as some of its biggest buyers are designing their own chips to eventually manage the growing demands for data crunching in AI. Google, for instance, designs its own AI chips now and as AIM predicted Microsoft is following suit with the development of its Athena chips. This would inevitably lead to lesser demands for NVIDIA’s chips in the long term.

Despite its hardware origins, NVIDIA looks committed to a slow but steady march towards cloud offerings. In 2020, the company acquired ‘Mellanox Technologies’, a cloud computing company for a whooping USD 6.9 billion.

The partnership comes on the heels of NVIDIA’s launch of ‘AI Foundations’, a new range of cloud services marked individually for different functions, especially within the realm of generative AI. It was merely a hint that NVIDIA was stepping foot outside of its GPU ring, and with its recent partnership with VMware NVIDIA has made it very evident.

Mutually Rewarding

Along with NVIDIA finding a new avenue for diversification, VMware would benefit handsomely. The company would find application across industries and widespread adoption as this collaboration aims to facilitate compatibility with companies like Dell, Lenovo, HPE, and others, enabling customers to easily deploy the solution according to their requirements.

“Using Nvidia AI software as well as Nvidia AI hardware, how do we almost make it like an appliance that you can work with Dell and Lenovo and HPE and others so customers can deploy it wherever they need to?” said VMware CEO Raghu Raghuram.

The partnership between NVIDIA and VMware doesn’t seem to be rushed, rather well thought out.

Jensen Huang also mentioned VMware again and again during NVIDIA’s blockbuster earnings call. With 15 mentions VMware was referred to as the go-to partner for NVIDIA above giants like Google or Apple in its cloud effort. So the partnership is not out of the blue and is mutually beneficial.

Additionally, NVIDIA is providing VMware with new computing capabilities. Jensen announced a breakthrough, enabling VMware to achieve bare metal performance while maintaining security, manageability, and V motion capabilities across various GPUs and nodes.

“GPUs are in every cloud or on-premise servers everywhere. And VMware is everywhere. And so for the very first time, enterprises around the world will be able to do private AI. Private AI at scale, deployed into your company and know that it’s fully secure and multi-platform,” Jensen said.

This would see increased and optimised usage of NVIDIA GPUs. The chip giant in turn is helping sell their vision of Private AI like it was nobody’s business.

So much so that the uber cool jacketed NVIDIA CEO called himself “the best sales guy” and in turn got a nice chuckle from Raghuram as an approval of his salesmanship.

Benefits For The Ecosystem

Collectively the partnership presents holistic solutions for the ecosystem. Its approach addresses privacy concerns, provides model flexibility, facilitates scalability, and optimises cost efficiency.

One of the main issues that this offering provides resolution to is enterprises’ uncertainty over privacy and security.

It could also bring relief to data centre costs and address the GPU crunch during this AI gold rush. By harnessing the full potential of computing resources encompassing GPUs, DPUs, and CPUs through virtual machines, the Private AI Foundation ensures optimal resource utilisation, ultimately translating into reduced overall costs for enterprises.

Flexibility emerges as another critical advantage.

“For any meaningful enterprise, their data lives in all types of locations, distributed computing and multi-cloud will be at the very foundation of AI, there’s no way to separate these two”, said VMware CEO Raghu Raghuram.

The platform offers businesses the freedom to choose where they build and deploy their models, ranging from the NVIDIA NeMo framework to versatile options like Llama 2 and beyond. This flexibility streamlines the model development process, enhancing adaptability to specific enterprise needs.

Conclusively, the VMware-NVIDIA partnership ushers in a new era of on-premise generative AI, reshaping the AI landscape and presenting a mutually rewarding collaboration that benefits both companies and the broader ecosystem.

The post Why NVIDIA Partnered with VMware appeared first on Analytics India Magazine.

Former Google DeepMind Researchers Go Deep for Sales Triumph

With the rush to create prototypes using large language models and releasing it on Hacker News, where none of them have seen integrated use cases, a startup is working on offering solutions that are feasible. Plunging right into a market where LLMs can be leveraged for sales solutions, which is projected to reach $770M by 2032, are former Google DeepMind employees through their startup Glyphic AI– an AI copilot for the sales team.

“I think one of the biggest challenges of Glyphic, as well as any LLM- based project, is handling hallucinations,” said Devang Agrawal, cofounder and CTO of Glyphic AI, in an exclusive interview with AIM. “One of the advantages of Glyphic is that, since most of our tasks are quite grounded, i.e. we are not just answering questions randomly, we’re trying to go through questions, calls, emails, and things like that, the scope of hallucinations is minimised. The answer is always based on something.”

In June, Glyphic AI officially came out of stealth mode and raised a pre-seed funding of $5.5M.

Navigating Large Language Models

“I think ChatGPT has brought to everyone’s mind the capabilities of AI, and everyone is now thinking about how it could be useful within their particular use case,” said Agrawal.

Speaking of other challenges with LLM – “Most people are using large language models these days with some prompt engineering, but, I think what people are going to realise is that this approach is not that scalable. You can have evaluation sets or have some tests, but it is very hard to know if you’re getting better or worse, and the sort of iterating on large language models is very challenging. For instance, by changing your prompt, you can completely break everything but you won’t truly be able to detect what broke. You can trip the model and the accuracy can come down, but it’s hard to test these models.”

At Glyphic, a mixture of few small models and large language models are used. “We kind of cleverly decide between GPT-4, Claude or Cohere based on the context and tasks we are trying to accomplish. For instance, Claude is able to understand long context and can understand 100,000 tokens in one go. This is something you can’t do with GPT-4 which can understand only up to 16,000 tokens. Claude is also trained in a much conversational way, whereas GPT-4 can be direct. So, based on the sort of context, we use one or the other model depending on what behaviour we actually want.”

Glyphic AI will sufficiently validate its product and use cases, and then move towards optimising it. After which, in the future, the company would work towards building their own language model.

Agrawal also spoke about a recent trend that is extensively surfacing. “What we’re seeing is that everyone is building these pretty-looking prototypes with large language models and putting it on Hacker News. While it looks nice, we still haven’t seen deeply integrated use cases, which are of high quality, high fidelity, and are being used everyday – because it is really challenging to do so.”

DeepMind Harvesting Entrepreneurs

Glyphic AI Co-founders Devang Agrawal and Adam Liska

Agrawal completed his graduation from Cambridge University and had always been fascinated with AI. He worked as a machine learning engineer at Apple on the Siri project. From a product-focused role, wanting to move onto a research and academic side of things, made Agrawal join a research-heavy environment. He thus joined as a research engineer at Google DeepMind where he worked for two years before moving on to start Glyphic AI with another DeepMind senior researcher Adam Liska.

“DeepMind is one of the most innovative companies, and one of those few places where you can be working for years and still be learning so much, because you are always on the forefront. It was a really hard decision to leave as the multimodal project that we were doing was going in a really exciting direction,” said Agrawal. “But, Adam and I were always clear about wanting to build a startup. From my college days I knew what I wanted, and was doing jobs to get the correct skills so when you do have a startup, it becomes more effective.”

Google DeepMind also supports people by giving them the flexibility to work on projects that they desire and even allows them to take risks. “It is a very supportive organisation where you can take speculative bets which sometimes work out and sometimes don’t. It is a great platform to develop as researchers and help us kind of inculcate this critical thinking which is very helpful for research.”

Agrawal believes that having expertise and being in the centre of transformational technology helped with the switch. “Many people are leaving DeepMind to set up startups, as they have expertise in this new transformational technology, which is now ready to be used in a product.”

There were two waves of exits that were fueled by technology change at Google DeepMind. “When reinforcement learning had come to the market, a number of people were leaving DeepMind to set up startups using the same, but what we realised was that it was really hard to bring deep reinforcement learning in a product context, and that wave slightly died out. This was around 2017. Now, with people having specialist skills in large language and multimodal models, people want to build products on it.”

A number of people have also gone on to build research-driven startups to solve large problems, such as cancer. “Designing new molecules that might work better for treating certain sorts of cancer, etc. is a different sort of startup that requires a different sort of thinking. We’re just focusing on slightly more open-ended things and solving the problem in a fundamental way.”

Roadmap For Glyphic

Currently, with ten employees, Glyphic focuses on applying large language models and generative AI to transform B2B sales processes. “Right now we’re just focusing on improving and optimising sales processes but we want to build on top of it to optimise the entire go-to market and product strategy with everything else.” Looking at the company’s vision in a year’s time, Agrawal looks to have a research team. “As we grow and scale, we would invest deeply into research and this would be one of the key differentiators.”

Glyphic currently provides services for software companies. “They’re so innovative with their sales processes and are willing to try out new products. Once we prove our products and models, we can expand into enterprises.” Glyphic also works with a few Indian companies which are headquartered or registered in the US.

“Microsoft is building something in this area, and so are other companies which are building copilot for sales, but I think this is one of the spaces where startups have a huge advantage. The technology is moving fast and you need to be able to completely rip out everything and change it within a couple of weeks if you want to stay ahead of the game. I think we definitely have an advantage because of our kind of technology background.”

The post Former Google DeepMind Researchers Go Deep for Sales Triumph appeared first on Analytics India Magazine.