I am not exaggerating when I say that the publication of
ChatGPT might mark the beginning of a
small revolution of education and the working
life, and I think this is as close as we have come to artificial intelligence depicted in movies (whether this is a
positive or negative thing is up to you).
But lets clarify first, what it is and how it works. ChatGPT is an offshoot or a specification of the GPT-3 language
model that specializes in conducting realistic dialogues! GPT stands for
Generative
Pre-trained
Transformer and GPT-3 is the third generation of
this architecture, who would have thought? It is an impressive autoregressive natural language processing (NLP)
machine learning model developed by the company OpenAI.
The model was trained on an enormous dataset of human-generated text, gathered from the internet with commoncrawl or
extracted from books.
Commoncrawl is an open source organization that stores internet archives and gathers text from platforms such as Wikipedia,
Reddit or Twitter and makes it available for the public. Huge datasets that are used to extract information from it
are also known as Big-Data - you might already have heard this term - and this use-case is a very good example
for that.
ChatGPT is a huge model (800GB) that uses a so-called transformer architecture, which basically means that it aims to learn
understanding and generating realistic sounding answers to a prompt / question by "reading" a vast amount of text from
which words or even whole sentences were masked-out. Syntax and semantics are learned by predicting the masked out
words or sentences and to either "reward or punish" the model based on the quality of their prediction. It tried to visualize this
concept down below. In step 1 the model predicts the term "tree" for the masked out word so the prediction is "Hello my tree is Adrian".
I think we can agree that this does not make much sense so we tell the model what we want to see instead, namely the word "name". Having this information
the model updates itself with a magical process called backpropagation (details are ignored for now) so that in step 2 it has learned that the desired word
is "name". We reward the model for providing the correct prediction.
This is of course as boiled down explanation of what is actually going on, but it should be enough to get a general understanding of how a machine learning
model is learning.
ChatGPT uses an even more advanced method called Reinforced Learning from Human Feedback (RFHF) for training, in which
linguists evaluate the quality of the model's responses and provide feedback to improve its performance. As a result, ChatGPT can
conduct dialogues in various languages effectively and make references to previous answers and questions. However,
this model is
not sentient! In the end it's just
a statistical model that gives you the most probable text answer to a provided question or phrase! Despite its
obvious benefits, this tool of course yields some problems. The model was trained in 2021 and is not connected to
the internet, thus it has no information
about current events and news. But as far as I can assess, the connection to the internet will just be a matter of
time. Sometimes the answers are incorrect because there is no mechanism (that I am aware of) to correct them. The
system simply outputs the most likely answer, even if it is not accurate.
But this is the nature of probability - even if you're 99% sure, there is still a 1% chance of being wrong! It even
might provide a sexist or racist answer to a question, because of the training data that might be biased.
I have even seen ChatGPT write an essay about why drug abuse sounds like a good idea, but this is not possible
anymore. And this is the thing. I would be surprised if people at OpenAI are not aware of these mentioned problems
so I guess many of these flaws will be taken care of in the next generations of the GPT models (maybe already for
the upcoming GPT-4 model). And also, most of the time the problem is not the computer itself but the human in front
of it. Fact-checking is always a good idea.
There are also other variations and use cases of the GPT-3 model.
Explainpaper for example let's you upload a pdf file and provides very
precise and easy explanations of highlighted text.
Rtutor.ai lets you generate R code to plot data that you want to
analyze.
If you haven't tried ChatGPT or any form of language model yet, I highly recommend getting familiar with these
tools. I can imagine that they can be very helpful for any aspect of your life!
I even let it write a blog post about itself. It's impressive
however it made itself even better than it actually is. Let's see if you find what's wrong!
Below you can find some inputs for ChatGPT that I find very interesting if you need some inspiration:
- What is the best chess opening?
- Can you make this sound more professional: Paste any text afterwards.
- Can you create an introduction email for me containing all important information:
Name: Mike
Department: Human Recourses
Hobbies: Football, Playing Board Games, Hiking
And please include a quote of Albert Einstein!
After this is done you can ask:
Can you embed this mail into html and style it nicely with css. An animation with JavaScript would also be
nice!
And this is the result (The complete file was written and
designed by this model)
- Create the game snake in python!
- Give me a nice tikka masala recipe with tofu!
- Write me a detailed assay about Cambridge analytica!
- Write a christmas message to my grandma!
- Fill in the parentheses: The meaning of life is [].
- Write a conversation between Bill Gates, Linus Torvalds and Tim Cook!
- Write me a nice and easy to read blog post about ChatGPT!