Gold Penguin Logo with Text

GPT-4 vs GPT-3: An Incredible Logic & Reasoning Upgrade

GPT-4 is here... and it's amazing. The sheer attention to detail is something we've never seen before. We'll go over some details, what's new, and comparisons between GPT4 and GPT3.5. Also, as AI gets better and better, what ethical issues does this raise to the general public?
Updated March 15, 2023
two robots battling in a boxing area, one is blue and one is red, photorealistic 4k picture taken from outside and slightly above the ring
two robots battling in a boxing area, one is blue and one is red, photorealistic 4k picture taken from outside and slightly above the ring

What if we could push the boundaries of an already revolutionary artificial intelligence model? I mean, just a few months ago nobody had ChatGPT in their lives. Today, its users would be crushed if it was taken away from them. It saves hundreds of It saves hundreds of hours, streamlines workflows, and sparks countless moments of inspiration. But as with all things groundbreaking, the quest for improvement never ends. Humans are greedy! We always want more.

Enter GPT-4, the successor to the much-celebrated GPT-3. In the ever-evolving landscape of AI, GPT-4 takes text generation, language understanding, and problem-solving to a whole new level. While it continues to face some limitations, this dynamic powerhouse has already left a legacy on countless industries and applications.

Over the next few minutes, we'll explore the captivating world of GPT-4, how it works with ChatGPT, and how it's revolutionizing generative AI (even more than last week).

What is GPT-4?

GPT-4 is a game-changing model that's leveling up the world of generative AI. This bad boy is a large multimodal model, taking in both image and text inputs, and it's absolutely mind-blowing with its human-level performance on various professional and academic benchmarks.

Think about it like this: GPT-4 had passed a simulated bar exam with a score in the top 10%, while GPT-3.5 was chilling at the bottom 10%. And I thought GPT-3.5 was incredible...

OpenAI spent half a year improving GPT-4 based on lessons learned from adversarial testing and ChatGPT, so it's safe to say we've got some next-level stuff happening here.

In a casual chat, the difference between GPT-3.5 and GPT-4 might not be super obvious, but once you push the complexity of the task, GPT-4 shines. It's more reliable, creative, and can handle way more nuanced instructions than its predecessor.

To really grasp the difference, OpenAI tested these models on a variety of benchmarks, including exams that were originally designed for humans! They used the most recent publicly-available tests and practice exams. And guess what? They didn't do any specific training for these exams. Crazy, right?

It's clear that GPT-4 has better reasoning skills than GPT-3. It still performed very good on many exams before & after, but there's a clear skill upgrade on exams that require more complex logic and reasoning.

GPT-4's text input capability is current available on ChatGPT if you have the premium version. Within a few weeks, it's estimated a larger rollout will happen to the general public.

GPT-4 vs GPT-3.5

Interestingly, OpenAI has been pretty tight-lipped about the specifics of GPT-4's size on exact numbers behind why it outshines its predecessor. They still showcased the improvements from a testing score standpoint, but nothing about the parameters behind the new model.

OpenAI's chief scientist, Ilya Sutskever, and other members of the GPT-4 team remained cautious about revealing too much. Sutskever mentioned, "It's pretty competitive out there," and that they couldn't really comment on the details at this time.

This secrecy could be attributed to the highly competitive nature of the AI industry, where every edge in technology might make a significant difference. OpenAI might be safeguarding their advancements to maintain a competitive advantage, which is not uncommon in such a fast-paced world like tech. As we continue to explore the differences between GPT-3.5 and GPT-4, we can expect more revelations to emerge over time, shedding light on the reasons behind GPT-4's superior performance.

I wanted to showcase some stark differences I noticed when asking ChatGPT the same question across different models. I tested reasoning, logic, and conversational skills. Here are some of the results:

GPT-4 Complex Vocabulary & Syntax Examples

Prompt: Describe in the most explicit detail what a slice of pizza is like. Describe everything in engaging, creative, and savory words to make the user want to eat a slice. Do this in a paragraph.

I've highlighted some of the best descriptions and syntactical word choices in bold. You could see GPT-4 is levels ahead of enticing its readers into the subject.

GPT-3.5 turbo (ChatGPT)GPT-4 (ChatGPT)
A slice of pizza is a culinary masterpiece, a perfect marriage of dough, sauce, cheese, and toppings, each ingredient harmonizing to create a symphony of flavor. The dough is soft and chewy, with just the right amount of resistance to the bite, and a tantalizing aroma that hits your nostrils as soon as the slice is lifted from the box. The sauce is rich and tangy, bursting with the flavors of tomatoes, garlic, and herbs, and perfectly balanced with just the right amount of sweetness. And then, there's the cheese, gooey and melted to perfection, stretching out in long strands as you take your first bite. The toppings add another layer of excitement, whether it's spicy pepperoni, salty olives, or fresh mushrooms, each bite a delightful surprise. With each chew, the flavors combine, creating a symphony of taste that dances across your tongue. It's not just a slice of pizza, it's a work of art, a culinary masterpiece that demands to be savored and enjoyed. So go ahead, take a bite, and let the flavors transport you to pizza heaven.Imagine a warm, mouthwatering slice of pizza, where the golden-brown crust is a symphony of crispiness and chewiness. The intoxicating aroma of freshly baked dough and melted cheese wafts through the air, luring your senses towards this heavenly delight. The velvety, rich tomato sauce is a harmonious blend of ripe tomatoes, garlic, and fragrant herbs, creating an impeccable balance between tangy and sweet. Blanketing the sauce is a generous layer of silky, gooey mozzarella cheese, bubbling and browning to perfection, stretching tantalizingly with every bite. A vibrant medley of toppings adorns the slice, from succulent, tender pepperoni to the earthy, umami-rich flavor of sautéed mushrooms, all melding together into a culinary masterpiece. Each bite of this tantalizing slice is a testament to the art of pizza making, transporting you to a realm of gastronomic ecstasy, making it nearly impossible to resist the temptation of savoring every last crumb.

Prompt: Generate me a description for YouTube as if I had to sell a subscription like my life depended on it. Make it no longer than a paragraph pitch. Be extremely detailed & creative.

In this case, I think GPT-3.5 actually wins. It seems like GPT-4-based ChatGPT is a lot more literal. It takes exactly what you give it and will fit to that command. This removes a lot of prompt hallucinations that we've seen in the past with generative AI, but I still prefer GPT-3.5's answer for this specific scenario.

GPT-3.5 turbo (ChatGPT)GPT-4 (ChatGPT)
YouTube is not just a video-sharing platform, it is a gateway to endless entertainment, education, and inspiration. With over 2 billion monthly active users, YouTube has become a global phenomenon, offering a diverse array of content that caters to every interest and passion. From cooking tutorials to makeup reviews, from travel vlogs to gaming streams, from music videos to comedy skits, there is something for everyone on YouTube. And with a subscription, you can unlock exclusive features like ad-free viewing, offline playback, and access to YouTube Originals, a series of high-quality content produced by YouTube itself. Imagine never having to sit through a commercial again, being able to watch your favorite videos on a long flight, or being the first to watch a new series starring your favorite creators. A subscription to YouTube is not just an investment in entertainment, it is an investment in your own personal growth, your own creativity, and your own happiness. Don't miss out on the opportunity to join the YouTube community and discover a world of endless possibilities.Discover a world of limitless entertainment and knowledge with YouTube Premium! Immerse yourself in ad-free, high-quality content, curated just for you. Explore a galaxy of captivating videos, from heart-pounding action movies to inspirational documentaries, laugh-out-loud comedy, and everything in between. Unlock the door to exclusive, cutting-edge YouTube Originals, featuring your favorite creators and celebrities. With background play and offline downloads, your adventures never stop, even when you're on the go. Plus, enjoy the sweet symphony of uninterrupted music with YouTube Music Premium, transforming your listening experience. Don't miss out on this life-changing opportunity to elevate your YouTube journey - subscribe to YouTube Premium now, and unlock a treasure trove of incredible content at your fingertips!

GPT-4 Concept Explanation & Reasoning

Prompt: Write me a 1 or 2 paragraph explanation about how SMTP/POP email servers work, but explain it like you would be talking to a group of 8th graders and include an analogy.

GPT-3.5 turbo (ChatGPT)GPT-4 (ChatGPT)
Email servers work like a post office for digital letters. SMTP (Simple Mail Transfer Protocol) is like the person who sends the letter, while POP (Post Office Protocol) is like the post office that receives and stores the letter until the recipient comes to pick it up.

When someone sends an email, their SMTP server sends the message to the recipient's POP server. The POP server then stores the message until the recipient's email program asks for it, just like the post office stores your physical mail until you come to pick it up. Once the recipient's email program requests the message, the POP server sends it to the recipient's computer or mobile device.

In summary, SMTP is like the sender of a digital letter, while POP is like the post office that stores and delivers the letter to the recipient when requested.
Alright, 8th graders, let's talk about how email works! You can think of email as a way of sending and receiving digital letters. Just like the post office handles sending and delivering physical letters, there are special computer systems called email servers that help us send and receive emails. Two important types of email servers are SMTP (Simple Mail Transfer Protocol) and POP (Post Office Protocol).

Imagine the SMTP server as a mail carrier who picks up your letters (emails) and delivers them to the right person. When you send an email, your computer connects to the SMTP server, which then sends your email to the recipient's email server. Now, the POP server is like your personal mailbox. It's where all your incoming mail gets stored until you're ready to read it. When you open your email app, it connects to the POP server to download your new emails so you can read them. So, in short, SMTP servers help deliver your emails, while POP servers store and manage the emails you receive, just like a mail carrier and your mailbox at home.

Prompt: Can you explain both pros and cons of pineapple on pizza? Give reasoning for both sides, and refute the opposition position. The entire pro argument and refutation should be a paragraph, and the con argument and refutation should also only be a single paragraph.

ChatGPT 3.5 didn't even listen to my instructions! It didn't give me exactly what I asked for. It was also answering at a very elementary level compared to GPT-4-based ChatGPT.

GPT-3.5 turbo (ChatGPT)GPT-4 (ChatGPT)
Pro: The addition of pineapple on pizza can bring a refreshing and sweet flavor to the classic savory pizza. The juicy and sweet pineapple provides a contrasting flavor to the salty and savory toppings, creating a unique and delicious taste. It can also provide a healthy option for those looking to add more fruit to their diet. Those who argue against pineapple on pizza may be resistant to change and experimentation in the culinary world. However, it is important to remember that taste is subjective and what may not suit one person's palate may be enjoyed by another.

Con: Pineapple on pizza can be a controversial topic for many, with some arguing that it disrupts the classic taste of a pizza. The sweetness of the pineapple can clash with the savory toppings, and the texture can be off-putting for some. The addition of pineapple on pizza may also be seen as a fad or trend that will eventually fade away.
Pros of pineapple on pizza and refutation of cons:
Pineapple on pizza offers a unique flavor combination, balancing sweetness with savory ingredients, and provides some health benefits due to its enzymes and nutrients. The argument against pineapple on pizza is often rooted in traditionalism and the clash with Italian flavors. However, this perspective can limit culinary exploration and overlooks the opportunity to cater to diverse taste preferences, making pizza a more versatile and inclusive dish.

Cons of pineapple on pizza and refutation of pros:
Critics argue that pineapple's sweetness disrupts the traditional harmony of pizza flavors and can lead to a soggy crust. Additionally, pineapple may not provide the same level of protein or satiety as other toppings. However, taste preferences are subjective, and the inclusion of pineapple can be appreciated for its distinct taste, blending sweet and savory elements. Furthermore, while pizza is not generally considered a health food, adding a small amount of pineapple can still offer some health benefits and contribute to culinary diversity.

Final Thoughts

GPT-4 has truly raised the bar in the world of artificial intelligence. While we don't know all the specifics about what makes it so much better than its predecessor, its remarkable performance across a wide range of tasks and benchmarks is undeniable. As we move forward, GPT-4 will continue to make waves in various industries and spark new advancements.

It's important to appreciate the progress GPT-4 represents in the AI field. This isn't just a simple upgrade – it's a meaningful step forward that showcases the ever-increasing potential of AI. So, as we embrace this new era, let's remain excited and curious about the possibilities that GPT-4 will unlock. There's no doubt that with GPT-4 leading the way, the future of artificial intelligence is brighter than ever.

Certainly, the release of GPT-4 also brings with it ethical and moral concerns that must be considered. As AI becomes more advanced and capable, there is a growing risk of misuse or abuse, which could lead to the spread of misinformation, the creation of deepfakes, or the amplification of harmful content. Additionally, concerns about surveillance and privacy may arise as AI systems become more adept at understanding human behavior and language.

Addressing these issues requires not only the vigilance of AI developers but also the engagement of users, policymakers, and society at large. OpenAI, for its part, has acknowledged the importance of addressing these concerns and has shown commitment to researching and implementing safety measures to mitigate potential risks.

Sam Altman himself even thinks we need more regulation on AI. It's a bit concerning to see the CEO of one of the leading tech companies in the world openly state what they are making should be regulated in some way. As for now, it's up to the ethical board behind OpenAI to ensure shipped products are ethically sound for the audience they intend to target.

It is crucial that we continue to have open conversations about the ethical implications of AI advancements like GPT-4, so that we can harness its potential while navigating the challenges it presents responsibly and ethically.

Despite the ethical and moral concerns that accompany the release of GPT-4, there is no denying that this technology represents a remarkable advancement in the field of artificial intelligence. The potential applications of GPT-4 are vast and varied, and it will be exciting to witness the innovative ways in which people leverage this powerful tool to solve problems, streamline workflows, and spark creativity across various industries. What an upgrade. What a time to be involved in tech!

Have you tried GPT-4? What are your thoughts?

Want To Learn Even More?
If you enjoyed this article, subscribe to our free monthly newsletter
where we share tips & tricks on how to use tech & AI to grow and optimize your business, career, and life.
Written by Justin Gluska
Justin is the founder of Gold Penguin, a business technology blog that helps people start, grow, and scale their business using AI. The world is changing and he believes it's best to make use of the new technology that is starting to change the world. If it can help you make more money or save you time, he'll write about it!
Notify of

Inline Feedbacks
View all comments
Join Our Newsletter!
If you enjoyed this article, subscribe to our free monthly newsletter where we share tips & tricks on how to use tech & AI to grow and optimize your business, career, and life.