Artificial intelligence is the science of making machines that can think like humans. It can do things that are considered "smart."
AI technology can process large amounts of data in ways, unlike humans. The goal for AI is to be able to do things such as recognize patterns, make decisions, and judge like humans.
This December of 2023, Google announced the release of Gemini, in the words of Demis Hassibis, the CEO of Google DeepMind, Gemini is “Google’s largest and most capable AI model.
Gemini can understand the world around us, in the way that we do. so not just text but also code, audio, image, and video”
According to Google and Alphabet CEO Sundar Pichai, “AI has the potential to create opportunities — from the everyday to the extraordinary — for people everywhere. It will bring new waves of innovation and economic progress and drive knowledge, learning, creativity and productivity on a scale we haven’t seen before.
That’s what excites me: the chance to make AI helpful for everyone, everywhere in the world.”
The following are some of the features that Google Gemini brings to surprise us and change the way we interact with machines and knowledge.
Gemini reads and extracts data from hundreds of thousands of relevant scientific articles in a few minutes, can generate tables of information, and being multimodal can make, for example, graphs. And very important can give you the source of the information, a very important matter that is especially important in science.
This is a time-consuming job that investigators do very often.
Google Gemini interacts with the user to obtain more specific information about their search, presenting multimodal information such as text, images, and videos until the answers perfectly match the user's needs.
Explaining reasoning in math and physics
A feature of Gemini that surprised us is the ability to read an exam or a handwritten worksheet, identify which exercises are well solved and which are not, and from these clarify the wrong concepts.
In addition to the above, we can ask Gemini to suggest more similar exercises to continue studying. Very useful for parents who have to help their children with their homework or exams.
Processing raw audio
Gemini can understand audio files with delicate nuances such as voices or pronunciation, this gives more richness to the questions we can ask and the answers it generates.
For example, summarizing a podcast that may be on a complicated topic or maybe in another language, or asking how to pronounce a word correctly in another language.
Gemini works together with image, audio, and text so we can ask questions with our voice, present images to be more specific or write other types of information, Gemini will navigate between the information presented in different ways to give a more accurate answer.
Some differences between Chat GPT and Google Gemini
Unlike GPT-4 chat, Google Gemini gives us a one-click reference to where the information was taken from.
Another difference is the integration that Gemini will have with Google products such as Gmail, YouTube, Google Docs, Google Drive, spreadsheets, and other services.
According to Wired.com Google says Gemini will be made available to developers through Google Cloud’s API from December 13. A more compact version of the model will from today power suggested messaging replies from the keyboard of Pixel 8 smartphones. Gemini will be introduced into other Google products including generative search, ads, and Chrome in “coming months,” the company says. The most powerful Gemini version of all will debut in 2024.
Google Gemini comes to initiate a digital era that just a few years ago we only saw in science fiction movies, multimodal artificial intelligence, i.e. understands not only text but also audio, images, and code, machines and programs thinking like humans and as good as the greatest experts in the world of each subject.
The purpose and vision of Google are to bring opportunities through innovation and economic progress. This new year 2024 we will all mankind to evaluate if the great of the search engines achieves it.
If you want to read more blogs about AI you can visit: