Google Gemini and OpenAI’s ChatGPT that uses the GPT-4 model are two of the most advanced artificial intelligence (AI) solutions available today. They can comprehend and interact with text, images, video, audio, and code, as well as output various alterations of each. they also provide expertise that would cost a lot to replicate with an expert human.

But if you’re weighing which tool to put your time and energies into learning how to use, you want to pick the best one. Which is the more capable AI tool? Gemini or GPT-4?

Both GPT-4 and Google Gemini are available now in one form or another. Gemini just needs a Google account to use it. GPT-4 is only readily available in the ChatGPT tool with a GPT-Plus subscription, which will cost you $20 a month.

You can use GPT 3.5 and GPT-4 through Microsoft Copilot, and there are some tools that let you use GPT-4 for free in a limited capacity, but the main way to use it is through ChatGPT.

It’s also important to note that while Gemini Pro is an effective form of the AI, it isn’t the final form. Google has promised that Gemini Ultra will be released later and will be the most capable of its AI solutions. It has also launched Gemini Nano, which runs locally on some smartphones and has a reduced feature set — it’s primarily used to power text summary and proofreading tools.

The main feature sets of Gemini and GPT-4 are very similar. Both can interact with and interpret text, images, videos, audio, and code, allowing you to use them for a wide range of tasks. You can have general conversations or ask questions of both chatbots, have them look at a picture of your cupboard ingredients and suggest a recipe based on it, estimate how many people are in a video of a crowd, or make suggestions for how to streamline your code to make it more efficient.

Both AI can be augmented with additional extensions, though at the time of writing, Google’s are more limited. Gemini can utilize Google Flights, Hotels, Maps, YouTube, and Google’s range of Workspace applications, letting you interact with documents, find live data on flights and hotels, and more.

However, OpenAI’s GPT-4 has a much greater array of plug-ins and extensions, most produced by third parties. While some of those capabilities are being rolled into GPT-4 as it is updated, there are lots of third-party extensions that can help you with shopping, advanced mathematical calculations, integration with thousands of other apps, and booking restaurant tables. There are even plug-ins to help you write better prompts.

In terms of customization, GPT-4 holds an edge on Gemini here as well. It lets you create custom versions of GPT-4 so that you can focus it on specific tasks. This can save you repeating yourself when you come back to the chat later. Gemini lets you launch new chats, but there are no customization options at this time.

GPT-4 can also create images on the fly. While Gemini is designed to do that, it’s not something it’s capable of just yet. GPT-4 has Dall-E 3 built into it, so if you’re already paying for GPT-Plus, there’s no need to also pay for an image-generation subscription.

Response time is clearly faster with Gemini, however. While GPT-4 has lulls where the sheer number of users can cause GPT-4 responses to slow or even be interrupted entirely, making GPT-4 unusable for short periods of time, Gemini responds incredibly quickly.

Both tools give you the option to fact-check them in some manner. GPT-4 provides source links for claims it makes at the end of its responses, while Gemini has a button that lets you perform a Google search for the information you’re looking for to confirm it yourself.

When Google launched Gemini, it highlighted how its Pro model could beat GPT-4 in a range of AI-based applications and benchmarks, including in reasoning capability. However, testing this in the real world is far more difficult, with both AIs displaying impressive responses to the same prompts, whether it’s for text, images, video, or coding.

I’ve personally found both AIs useful when asking questions about game coding, and both function well as conversational chatbots. Gemini seems to be programmed to steer clear of discussion about or interactions with images of certain people, however. It refused to identify the clothing of a celebrity I showed it, and when asked about yours truly, it couldn’t tell me much. GPT-4 had no such problems.

While I was given the option to use Google to double-check other responses, Gemini didn’t use Google to find information about me, as GPT-4 did with Bing, nor did it give me the option to fact-check it.

With all that in mind, at the time of writing, GPT-4 feels like it holds an edge. It’s a more complete tool, with greater capabilities through plug-ins and custom chatbots. Gemini feels equally capable in terms of raw ability, and it responds very fast, but it doesn’t quite have the feature set. Google Gemini holds much promise, but we’ll need to wait to see how available that integration of Google’s AI is before commenting further.

Not sure which AI tool to use? Have a look at our roundup of the best AI chatbots.

Related Posts

New study shows AI isn’t ready for office work

A reality check for the "replacement" theory

Google Research suggests AI models like DeepSeek exhibit collective intelligence patterns

The paper, published on arXiv with the evocative title Reasoning Models Generate Societies of Thought, posits that these models don't merely compute; they implicitly simulate a "multi-agent" interaction. Imagine a boardroom full of experts tossing ideas around, challenging each other's assumptions, and looking at a problem from different angles before finally agreeing on the best answer. That is essentially what is happening inside the code. The researchers found that these models exhibit "perspective diversity," meaning they generate conflicting viewpoints and work to resolve them internally, much like a team of colleagues debating a strategy to find the best path forward.

Microsoft tells you to uninstall the latest Windows 11 update

https://twitter.com/hapico0109/status/2013480169840001437?s=20