Your Ultimate Guide to Chat GPT and Other Abbreviations – KDnuggets
ML (machine learning) is an approach to solving difficult computational problems instead of coding using a programming language you build an algorithm that learns the solution from data samples.
AI (artificial intelligence) is a field of computer science dealing with problems (e.g., image classification, working with human language) that are difficult to solve using traditional programming. ML and AI go hand in hand, with ML being a tool to solve problems formulated in AI.
AGI (artificial general intelligence) - is the correct term for what popular culture usually implies by AI the ability of computers to achieve human-like intellectual capabilities and broad reasoning. It is still the holy grail for researchers working in the AI field.
An artificial neural network (ANN) is a class of ML algorithms and data structures (or models for short) so called because it was inspired by the structure of biological neural tissue. But this doesnt completely mimic all the biological mechanisms behind it. Rather, ANNs are complicated mathematical functions that are based on ideas from living species biology.
Neural networks are layered structures consisting of uniform units interconnected with each other in a network. The way these units are interconnected is called architecture. Each connection has an associated number called weight and the weights store information the model learns from data. So, when you read the model has 2 billion parameters, it means that there are 2 billion connections (and weights) in the model, and it roughly designates the information capacity of the neural network.
Neural networks have been studied since the 1980s but made a real impact when the computer games industry introduced cheap personal supercomputers known as graphical processing units (GPUs). Researchers adapted this hardware for the neural network training process and achieved impressive results. One of the first deep learning architectures, the convolutional neural network (CNN), was able to carry out sophisticated image recognition that was difficult with classical computer vision algorithms. Since then, ML with neural networks has been rebranded as deep learning, with deep referring to the complicated NN architectures the networks are able to explore.
Id recommend videos by Grant Sanderson available on his animated math channel.
To work with human language using computers, language must be defined mathematically. This approach should be sufficiently generic to include the distinctive features of every language. In 2003 researchers discovered how to represent language with neural networks and called it the neural probabilistic language model or LM for short. This works like predictive text in a mobile phone given some initial sequence of words (or tokens), the model can predict the next possible words with their respective probabilities. Continuing this process using previously generated words as input (this is autoregression) the model can generate text in the language for which it was trained.
Representing sequences of items was a challenging problem for neural networks. There were several attempts to solve the problem (mostly around variations of recurrent neural networks), which yielded some important ideas (e.g., word embedding, encoder-decoder architecture, and attention mechanism). In 2017 a group of Google researchers proposed a new NN architecture that they called a transformer. It combined all these ideas with effective practical implementation. It was designed to solve the language translation problem (hence the name) but proved to be efficient for capturing the statistical properties of any sequence data.
OpenAI experimented with transformers to build a neural probabilistic language model. The results of their experiments are called GPT (generative pre-trained transformer) models. Pre-trained means they were training the transformer NN on a large body of texts mined on the Internet and then taking its decoder part for language representation and text generation. There were several generations of GPTs:
Given the enormous number of parameters GPT models have (in fact, you need a huge computational cluster with hundreds to thousands of GPUs to train and serve these models), they were called Large Language Models (LLMs).
The original GPT-3 is still a word prediction engine and thus is mostly of interest to AI researchers and computational linguists. Given some initial seed or prompt, it can generate text infinitely, which makes little practical sense. The OpenAI team continued to experiment with the model, trying to fine-tune it to treat prompts as instructions to execute. They fed in a large dataset of human-curated dialogues and invented a new approach (RLHF reinforcement learning from human feedback) to significantly speed up this process with another neural network as a validator agent (typical in AI research). They released a model called InstructGPT as an MVP based on a smaller GPT-3 version and in November 2022 released a full-featured version called ChatGPT. With its simple chatbot and web UI, it changed the IT world.
Given that LLMs are just sophisticated statistical machines, the generation process could go in an unexpected and unpleasant direction. This type of result is sometimes called an AI hallucination, but from the algorithmic perspective, it is still valid, though unexpected, by human users.
Raw LLMs require treatment and additional fine-tuning with human validators and RLHF, as previously mentioned. This is to align LLMs with human expectations, and not surprisingly the process itself is called alignment. This is a long and tedious procedure with considerable human work involved; this could be considered LLM quality assurance. The alignment of the models is what distinguishes OpenAI/Microsoft ChatGPT and GPT-4 from their open-source counterparts.
Neural networks are black boxes (a huge array of numbers with some structure on top). There are some methods to explore and debug their internals but the exceptional generalization qualities of GPTs remain unexplained. This is the main reason behind the ban movement some researchers think we are playing with fire (science fiction gives us fascinating scenarios of AGI birth and technological singularity) before we get a better understanding of the processes underlying LLMs.
The most popular include:
GPTs are the most mature models with API access provided by OpenAI and Microsoft Azure OpenAI services (if you need a private subscription). But this is the frontier of AI and many interesting things have happened since the release of ChatGPT. Google has built its PaLM-2 model; Meta open-sourced their LLaMA models for researchers, which spurred lots of tweaks and enhancements (e.g., Alpaca from Stanford) and optimization (now you can run LLMs on your laptop and even smartphone).
Huggingface provides BLOOM and StarCoder and HuggingChat which are completely open source, without the LLaMA research-only limitation. Databricks trained their own completely open-source Dolly model. Lmsys.org is offering its own Vicuna LLM. Nvidias deep learning research team is developing its Megatron-LM model. The GPT4All initiative is also worth mentioning.
However, all these open-source alternatives are still behind OpenAIs major tech (especially in the alignment perspective) but the gap is rapidly closing.
The easiest way is to use OpenAI public service or their platform API playground, which offers lower-level access to the models and more control over network inner workings (specify system context, tune generation parameters, etc). But you should carefully review their service agreements since they use user interactions for additional model improvements and training. Alternatively, you can choose Microsoft Azure OpenAI services, which provide the same API and tools but with private model instances.
If you are more adventurous, you can try LLM models hosted by HuggingFace, but youll need to be more skilled with Python and data science tooling. Denis Shipilov is experienced Solutions Architect with wide range of expertise from distributed systems design to the BigData and Data Science related projects.
Read more here:
Your Ultimate Guide to Chat GPT and Other Abbreviations - KDnuggets
- How much time do we have before Artificial General Intelligence (AGI) to turns into Artificial Self-preserving - The Times of India - November 5th, 2024 [November 5th, 2024]
- Simuli to Leap Forward in the Trek to Artificial General Intelligence through 2027 Hyperdimensional AI Ecosystem - USA TODAY - November 5th, 2024 [November 5th, 2024]
- Implications of Artificial General Intelligence on National and International Security - Yoshua Bengio - - October 31st, 2024 [October 31st, 2024]
- James Cameron says the reality of artificial general intelligence is 'scarier' than the fiction of it - Business Insider - October 31st, 2024 [October 31st, 2024]
- James Cameron says the reality of artificial general intelligence is 'scarier' than the fiction of it - MSN - October 31st, 2024 [October 31st, 2024]
- Bot fresh hell is this?: Inside the rise of Artificial General Intelligence or AGI - MSN - October 31st, 2024 [October 31st, 2024]
- Artificial General Intelligence (AGI) Market to Reach $26.9 Billion by 2031 As Revealed In New Report - WhaTech - September 26th, 2024 [September 26th, 2024]
- 19 jobs artificial general intelligence (AGI) may replace and 10 jobs it could create - MSN - September 26th, 2024 [September 26th, 2024]
- Paige Appoints New Leadership to Further Drive Innovation, Bring Artificial General Intelligence to Pathology, and Expand Access to AI Applications -... - August 16th, 2024 [August 16th, 2024]
- Artificial General Intelligence, If Attained, Will Be the Greatest Invention of All Time - JD Supra - August 11th, 2024 [August 11th, 2024]
- OpenAI Touts New AI Safety Research. Critics Say Its a Good Step, but Not Enough - WIRED - July 22nd, 2024 [July 22nd, 2024]
- OpenAIs Project Strawberry Said to Be Building AI That Reasons and Does Deep Research - Singularity Hub - July 22nd, 2024 [July 22nd, 2024]
- One of the Best Ways to Invest in AI Is Dont - InvestorPlace - July 22nd, 2024 [July 22nd, 2024]
- OpenAI is plagued by safety concerns - The Verge - July 17th, 2024 [July 17th, 2024]
- OpenAI reportedly nears breakthrough with reasoning AI, reveals progress framework - Ars Technica - July 17th, 2024 [July 17th, 2024]
- ChatGPT maker OpenAI now has a scale to rank its AI - ReadWrite - July 17th, 2024 [July 17th, 2024]
- Heres how OpenAI will determine how powerful its AI systems are - The Verge - July 17th, 2024 [July 17th, 2024]
- OpenAI may be working on AI that can perform research without human help which should go fine - TechRadar - July 17th, 2024 [July 17th, 2024]
- OpenAI has a new scale for measuring how smart their AI models are becoming which is not as comforting as it should be - TechRadar - July 17th, 2024 [July 17th, 2024]
- OpenAI says there are 5 'levels' for AI to reach human intelligence it's already almost at level 2 - Quartz - July 17th, 2024 [July 17th, 2024]
- AIs Bizarro World, were marching towards AGI while carbon emissions soar - Fortune - July 17th, 2024 [July 17th, 2024]
- AI News Today July 15, 2024 - The Dales Report - July 17th, 2024 [July 17th, 2024]
- The Evolution Of Artificial Intelligence: From Basic AI To ASI - Welcome2TheBronx - July 17th, 2024 [July 17th, 2024]
- What Elon Musk and Ilya Sutskever Feared About OpenAI Is Becoming Reality - Observer - July 17th, 2024 [July 17th, 2024]
- Companies are losing faith in AI, and AI is losing money - Android Headlines - July 17th, 2024 [July 17th, 2024]
- AGI isn't here (yet): How to make informed, strategic decisions in the meantime - VentureBeat - June 16th, 2024 [June 16th, 2024]
- Apple's AI Privacy Measures, Elon Musk's Robot Prediction, And More: This Week In Artificial Intelligence - Alphabet ... - Benzinga - June 16th, 2024 [June 16th, 2024]
- AGI and jumping to the New Inference Market S-Curve - CMSWire - June 16th, 2024 [June 16th, 2024]
- Apple's big AI announcements were all about AI 'for the rest of us'Google, Meta, Amazon and, yes, OpenAI should ... - Fortune - June 16th, 2024 [June 16th, 2024]
- Elon Musk Withdraws His Lawsuit Against OpenAI and Sam Altman - The New York Times - June 16th, 2024 [June 16th, 2024]
- Staying Ahead of the AI Train - ATD - June 16th, 2024 [June 16th, 2024]
- OpenAI disbands its AI risk mitigation team - - May 20th, 2024 [May 20th, 2024]
- BEYOND LOCAL: 'Noise' in the machine: Human differences in judgment lead to problems for AI - The Longmont Leader - May 20th, 2024 [May 20th, 2024]
- Machine Learning Researcher Links OpenAI to Drug-Fueled Sex Parties - Futurism - May 20th, 2024 [May 20th, 2024]
- What Is AI? How Artificial Intelligence Works (2024) - Shopify - May 20th, 2024 [May 20th, 2024]
- Vitalik Buterin says OpenAI's GPT-4 has passed the Turing test - Cointelegraph - May 20th, 2024 [May 20th, 2024]
- "I lost trust": Why the OpenAI team in charge of safeguarding humanity imploded - Vox.com - May 18th, 2024 [May 18th, 2024]
- 63% of surveyed Americans want government legislation to prevent super intelligent AI from ever being achieved - PC Gamer - May 18th, 2024 [May 18th, 2024]
- Top OpenAI researcher resigns, saying company prioritized 'shiny products' over AI safety - Fortune - May 18th, 2024 [May 18th, 2024]
- The revolution in artificial intelligence and artificial general intelligence - Washington Times - May 18th, 2024 [May 18th, 2024]
- OpenAI disbands team devoted to artificial intelligence risks - Yahoo! Voices - May 18th, 2024 [May 18th, 2024]
- OpenAI disbands safety team focused on risk of artificial intelligence causing 'human extinction' - New York Post - May 18th, 2024 [May 18th, 2024]
- OpenAI disbands team devoted to artificial intelligence risks - Port Lavaca Wave - May 18th, 2024 [May 18th, 2024]
- OpenAI disbands team devoted to artificial intelligence risks - Moore County News Press - May 18th, 2024 [May 18th, 2024]
- Generative AI Is Totally Shameless. I Want to Be It - WIRED - May 18th, 2024 [May 18th, 2024]
- OpenAI researcher resigns, claiming safety has taken a backseat to shiny products - The Verge - May 18th, 2024 [May 18th, 2024]
- Most of Surveyed Americans Do Not Want Super Intelligent AI - 80.lv - May 18th, 2024 [May 18th, 2024]
- A former OpenAI leader says safety has 'taken a backseat to shiny products' at the AI company - Winnipeg Free Press - May 18th, 2024 [May 18th, 2024]
- DeepMind CEO says Google to spend more than $100B on AGI despite hype - Cointelegraph - April 20th, 2024 [April 20th, 2024]
- Congressional panel outlines five guardrails for AI use in House - FedScoop - April 20th, 2024 [April 20th, 2024]
- The Potential and Perils of Advanced Artificial General Intelligence - elblog.pl - April 20th, 2024 [April 20th, 2024]
- DeepMind Head: Google AI Spending Could Exceed $100 Billion - PYMNTS.com - April 20th, 2024 [April 20th, 2024]
- Say hi to Tong Tong, world's first AGI child-image figure - ecns - April 20th, 2024 [April 20th, 2024]
- Silicon Scholars: AI and The Muslim Ummah - IslamiCity - April 20th, 2024 [April 20th, 2024]
- AI stocks aren't like the dot-com bubble. Here's why - Quartz - April 20th, 2024 [April 20th, 2024]
- AI vs. AGI: The Race for Performance, Battling the Cost? for NASDAQ:GOOG by Moshkelgosha - TradingView - April 20th, 2024 [April 20th, 2024]
- We've Been Here Before: AI Promised Humanlike Machines In 1958 - The Good Men Project - April 20th, 2024 [April 20th, 2024]
- Google will spend more than $100 billion on AI, exec says - Quartz - April 20th, 2024 [April 20th, 2024]
- Tech companies want to build artificial general intelligence. But who decides when AGI is attained? - The Bakersfield Californian - April 8th, 2024 [April 8th, 2024]
- Tech companies want to build artificial general intelligence. But who decides when AGI is attained? - The Caledonian-Record - April 8th, 2024 [April 8th, 2024]
- What is AGI and how is it different from AI? - ReadWrite - April 8th, 2024 [April 8th, 2024]
- Artificial intelligence in healthcare: defining the most common terms - HealthITAnalytics.com - April 8th, 2024 [April 8th, 2024]
- We're Focusing on the Wrong Kind of AI Apocalypse - TIME - April 8th, 2024 [April 8th, 2024]
- Xi Jinping's vision in supporting the artificial intelligence at home and abroad - Modern Diplomacy - April 8th, 2024 [April 8th, 2024]
- As 'The Matrix' turns 25, the chilling artificial intelligence (AI) projection at its core isn't as outlandish as it once seemed - TechRadar - April 8th, 2024 [April 8th, 2024]
- AI & robotics briefing: Why superintelligent AI won't sneak up on us - Nature.com - January 10th, 2024 [January 10th, 2024]
- Get Ready for the Great AI Disappointment - WIRED - January 10th, 2024 [January 10th, 2024]
- Part 3 Capitalism in the Age of Artificial General Intelligence (AGI) - Medium - January 10th, 2024 [January 10th, 2024]
- Artificial General Intelligence (AGI): what it is and why its discovery can change the world - Medium - January 10th, 2024 [January 10th, 2024]
- Exploring the Path to Artificial General Intelligence - Medriva - January 10th, 2024 [January 10th, 2024]
- The Acceleration Towards Artificial General Intelligence (AGI) and Its Implications - Medriva - January 10th, 2024 [January 10th, 2024]
- OpenAI Warns: "AGI Is Coming" - Do we have a reason to worry? - Medium - January 10th, 2024 [January 10th, 2024]
- The fight over ethics intensifies as artificial intelligence quickly changes the world - 9 & 10 News - January 10th, 2024 [January 10th, 2024]
- AI as the Third Window into Humanity: Understanding Human Behavior and Emotions - Medriva - January 10th, 2024 [January 10th, 2024]
- Artificial General Intelligence (AGI) in Radiation Oncology: Transformative Technology - Medriva - January 10th, 2024 [January 10th, 2024]
- Exploring the Potential of AGI: Opportunities and Challenges - Medium - January 10th, 2024 [January 10th, 2024]
- Full-Spectrum Cognitive Development Incorporating AI for Evolution and Collective Intelligence - Medriva - January 10th, 2024 [January 10th, 2024]
- Artificial Superintelligence - Understanding a Future Tech that Will Change the World! - MobileAppDaily - January 10th, 2024 [January 10th, 2024]
- Title: AI Unveiled: Exploring the Realm of Artificial Intelligence - Medium - January 10th, 2024 [January 10th, 2024]
- The Simple Reason Why AGI (Artificial General Intelligence) Is Not ... - Medium - December 2nd, 2023 [December 2nd, 2023]