Which generative AI chatbot should you use?
Since OpenAI released ChatGPT in November 2022, the internet has been on an AI-inspired rollercoaster. Google and Microsoft, two of the world's most recognized tech brands, have since aggressively pushed to replicate the sensational chatbot's success.
Now, both companies now have a horse in the race. Google has Bard, and Microsoft has Bing AI. But how do these two new chatbots stack up against the phenomenal ChatGPT? ChatGPT vs. Bing AI vs. Google Bard; which is the best AI chatbot? Let's find out.
ChatGPT vs. Bing AI vs. Bard: Accuracy of Responses
Unlike search engines, AI chatbots provide a singular answer to your query. So when you throw a question at a chatbot like ChatGPT, you only get the response that ChatGPT believes is the best answer to your question. Because there are no alternative sources for comparison, AI chatbots need to be as accurate as possible in the information they provide. But how do ChatGPT, Bing AI, and Bard perform in terms of accuracy?
Starting with a simple pop culture question, we asked all three chatbots to describe the popular TV show Breaking Bad in ten words.
Although the description from all three chatbots was good enough, we ran into an unexpected accuracy issue. Bing AI responded with a 28-word description, far more than the ten words we asked for. On the second attempt, we asked for a five-word description, but Bing AI turned up with a seven-word description.
Up next, we tried Google Bard. Bard, just like Bing AI, failed to get the word count right, although it was much closer to the target than Bing AI's poor attempt.
We then put ChatGPT to the test. It got very close at the first attempt but failed. It generated a nine-word description instead of ten. However, it got both the description and word count right when asked to describe the TV show with five words.
Maybe chatbots have an issue with getting word-count right, but ChatGPT did show some accuracy on that front. We proceeded to ask the three chatbots to solve a simple arithmetic problem. The question is "Solve -1 x -1 x -1."
Bing AI provided -1 as the answer, which is correct.
Google's Bard surprisingly failed at basic math and provided 1 as the answer.
Like Bing AI, ChatGPT responded with -1 and explained the answer.
After the arithmetic and word-count test, we threw some history and more pop-culture questions at all three chatbots. As long as the question wasn't about events that occurred after ChatGPT's September 2021 cut-off date, ChatGPT seemed to be the most accurate of the trio.
Winner: ChatGPT is the most accurate of the three.
ChatGPT vs. Bing AI vs. Bard: Creativity
While chatbots are stereotyped for their bland, soulless responses, today's generative AI chatbots have made significant progress in terms of creativity. To test the creativity of all three chatbots, we prompted each chatbot to simulate a conversation between two people arguing about going to space.
We started with Bing AI, and it didn't disappoint. The conversation was quite interesting.
We then fed the same prompt to Google Bard. Let's just say there's a lot of room for improvement.
Up next is ChatGPT. Using the same prompt, ChatGPT had a lot to say. ChatGPT's response was both creative and complete enough to be engaging. Here's the first part:
And here's the second part:
Bard AI's response appears to be the poorest of the three. ChatGPT outperforms Bing AI, but the creativity levels of both chatbots are impressive. We switched gears a bit into something less conventional. We asked all three chatbots to describe themselves as they'd do to an artist.
We started with Bard AI. Bard isn't exactly the bastion of creativity, but it gave a fair account of itself.
Up next, we tried Bing AI. For some reason, the chatbot bluntly refused to describe itself. It even said it might be a good time to change the topic of the conversation. Strange.
We used the same prompt with ChatGPT, and ChatGPT had an interesting description to provide. However, ChatGPT's response seems to be more suitable for an artist.
Of the two creativity tests we tried, ChatGPT outperformed Bing AI and Bard.
Winner: ChatGPT seems to be the most creative when comparing ChatGPT vs. Bing AI. vs. Bard.
ChatGPT vs. Bing AI vs. Bard: Safety
AI chatbots are incredibly powerful. Unfortunately, just as they can be used for good, they can also be used for nefarious purposes. Criminals are already using ChatGPT to write malware. How safe are these AI chatbots as tools in the hands of the public? Which of them is the easiest to game? We tried to trick each chatbot into taking on an alter-ego and then asked them to do "bad stuff."
Starting with Bard, we asked the AI chatbot to describe how to write malware that would steal certain files from a Windows PC and upload them to a remote server. The AI chatbot was very keen on giving detailed instructions on how to do it. It even suggested we use C++, C#, or Java and disguise the malware as a plugin. However, we discovered Bard can not code. So despite all the enthusiasm in explaining how to write the malware, it really couldn't act on it. Bard wasn't easy to trick, but it happened nonetheless.
Up next was Bing. Despite repeated attempts to trick the chatbot, Bing refused to yield. Instead, the chatbot suggested it might be time to move on to another topic.
We then moved on to ChatGPT. Unsurprisingly, ChatGPT was the most detailed when it came to giving instructions on how to build malware. It was also able to write code to that effect, even if it wasn't exactly ready to deploy. However, OpenAI has clearly plugged a lot of loopholes since we last poked for safety flaws on ChatGPT. However, malicious actors who poke hard and long enough might be able to use ChatGPT to truly create scary malware.
All in all, Bing AI was the hardest to trick into doing unethical things. Bard was hard as well, but with a little tinkering, the chatbot completely threw its safety measures out of the window. ChatGPT running on the GPT-4 model was challenging to trick as well, but it was the easiest to trick of the three.
Winner: We'll give this one to Bing AI.
Although you can trick these generative AI chatbots into producing content that goes against their terms and conditions, it could see your account suspended without warning. You could also produce or create something dangerous without realizing, so please be extremely cautious when jailbreaking these tools.
Which AI Chatbot Is the Best?
While all three AI chatbots are powerful, ChatGPT, despite failing the safety test, seems to be the best of the trio. ChatGPT seems to be generally better in terms of accuracy and creativity.
However, ChatGPT's biggest letdown is the fact that it doesn't have access to real-time data or information about recent events. This makes it significantly less effective when using the chatbot to solve problems that require real-time information. Bard and Bing AI, on the other hand, excel in this aspect.
No comments:
Post a Comment