Artificial intelligence keeps getting more intelligent.
Two years ago, the Google Brain team began employing machine learning techniques to teach a computer how to interpret and caption images. Sure, it won’t win any humor contests for being punny or particularly clever, but if you’re looking for a literal translation of what you’re looking at, Google’s AI system has you covered.
On Thursday, the internet giant announced that it had made “the latest version of our image captioning system available as an open source model in TensorFlow.” The most recent iteration of its AI “contains significant improvements to the computer vision component of the captioning system, is much faster to train, and produces more detailed and accurate descriptions compared to the original system,” Google said.
Called “Show and Tell,” the algorithm can recognize objects in imagery with an impressive 93.9 percent accuracy rate. That’s quite the improvement from just two years ago, when the AI was still scoring in the B-range, identifying images correctly just 89.6 percent of the time. So what’s changed? In essence, Google’s tool now tries to describe objects rather than simply classifying them.
“For example, an image classification model will tell you that a dog, grass and a Frisbee are in the image,” Google noted, “But a natural description should also tell you the color of the grass and how the dog relates to the Frisbee.”
While you may not need Google to tell you what you’re looking at on a daily basis, these machine learning capabilities could be used to help those with visual impairments, and further the work of other AI researchers. “We hope that sharing this model in TensorFlow will help push forward image captioning research and applications, and will also allow interested people to learn and have fun,” Google said.
For a full description of Google’s latest algorithm, check out “Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge,” published in IEEE Transactions on Pattern Analysis and Machine Intelligence.
Related Posts
How to change margins in Google Docs
You can easily change the left, right, top, and bottom margins in Google Docs and have a few different ways to do it.
What is Microsoft Teams? The Slack rival does things other collaborative tools can’t
With Microsoft Teams, you're able to chat, video conference, share documents and edit them together, and easily coordinate schedules and workflows. Recently, Microsoft Teams did get a price bump for Microsoft Personal users (from $7 to $10), and the company also added a wave of AI agents into the mix.
Microsoft Word vs. Google Docs
However, using Google Docs proves it still has a long way to go before it can match all of Word's features -- Microsoft has been developing its word processor for over 30 years, after all, and millions still use Microsoft Word. Will Google Docs' low barrier to entry and cross-platform functionality win out? Let's break down each word processor in terms of features and capabilities to help you determine which is best for your needs. How does each word processing program compare? To put it lightly, Microsoft Word has an incredible advantage over Google Docs in terms of raw technical capability. From relatively humble beginnings in the 1980s, Microsoft has added new tools and options in each successive version. Most of the essential editing tools are available in Google Docs, but users who are used to Word will find it limited.