Skip to main content

All of the internet now belongs to Google’s AI

Google Bard being shown off at Google I/O 2023.
Image used with permission by copyright holder

Google’s latest update to its privacy policy will make it so that the company has free range to scrape the web for any content that can benefit building and improving its AI tools.

“Google uses information to improve our services and to develop new products, features, and technologies that benefit our users and the public,” the new Google policy says. “For example, we use publicly available information to help train Google’s AI models and build products and features like Google Translate, Bard, and Cloud AI capabilities.”

Gizmodo notes that the policy has been updated to say “AI models” when it previously said, “for language models.” Additionally, the policy added Bard and Cloud AI, when it previously only mentioned Google Translate, for which it collected data.

The privacy policy, which was updated over the weekend, appears especially ominous because it indicates that any information you produce online is up for grabs for Google to use for training its AI models.

The aforementioned wording seems to describe not just those in the Google ecosystem in one way or another but is detailed in such a way that the brand could have access to information from any part of the web.

Major issues surrounding the mass development of artificial intelligence are questions about privacy, plagiarism, and whether AI can dispel correct information. Early versions of chatbots such as ChatGPT are based on large language models (LLMs) that used already public sources, such as the common crawl web archive, WebText2, Books1, Books2, and Wikipedia as training data.

Early ChatGPT was infamous for becoming stuck on information beyond 2021 and subsequently filling in responses with false data. This could likely be one of the reasons Google would want unfettered access to web data to benefit tools such as Bard, to have real-world and potentially real-time training for its AI models.

Gizmodo also noted that Google could use this new policy to collect old, but still human-generated content, such as long-forgotten reviews or blog posts, to still have a feel of how human text and speech is developed and distributed. Still, it remains to be seen exactly how Google will use the data it collects.

Several social media platforms, including Twitter and Reddit, which are major sources of up-to-date information have already limited their public access in the wake of AI chatbot popularity, to the chagrin of their entire communities.

Both platforms have closed free access to their APIs, which restricts users from downloading massive amounts of posts for sharing elsewhere, under the guise of protecting their intellectual property. This instead broke many of the third-party tools that make both Twitter and Reddit run smoothly.

Both Twitter and Reddit have had to deal with other setbacks and controversies as their owners’ concerns heighten about AI taking over.

Editors' Recommendations

Fionna Agomuoh
Fionna Agomuoh is a technology journalist with over a decade of experience writing about various consumer electronics topics…
OpenAI and Microsoft sued by NY Times for copyright infringement
A phone with the OpenAI logo in front of a large Microsoft logo.

The New York Times has become the first major media organization to take on AI firms in the courts, accusing OpenAI and its backer, Microsoft, of infringing its copyright by using its content to train AI-powered products such as OpenAI's ChatGPT.

In a lawsuit filed in Federal District Court in Manhattan, the media giant claims that “millions” of its copyrighted articles were used to train its AI technologies, enabling it to compete with the New York Times as a content provider.

Read more
Microsoft Copilot: tips and tricks for using AI in Windows
Microsoft Copilot allows you to ask an AI assistant questions within Office apps.

Microsoft's Copilot might not be breaking ground in quite the same way as ChatGPT seemed to when it first debuted, but there are still some useful abilities for this desktop-ready chatbot AI that is now available to pretty much anyone running the latest version of Windows 11. It doesn't have a huge range of abilities yet, confining itself to changing some Windows settings, opening apps for you, and performing the creative writing and web search functions available through its contemporaries.

But you can make Copilot work for you and work well, and there are some tips and tricks you'll want to employ to make the most of it. Here are some of my favorites.
Go hands-free
While the latest natural language AIs might be primarily text-based, many of them now include voice and audio support, and Windows Copilot is much the same. While this might seem like merely a more clunky way to interact with Copilot -- and it is kind of clunky -- this is an important feature because it means you don't have to use your hands to prompt it. Beyond clicking the little microphone button, you can get back to whatever you're doing while asking it a question or requesting something from it.

Read more
2023 was the year of AI. Here were the 9 moments that defined it
A person's hand holding a smartphone. The smartphone is showing the website for the ChatGPT generative AI.

ChatGPT may have launched in late 2022, but 2023 was undoubtedly the year that generative AI took hold of the public consciousness.

Not only did ChatGPT reach new highs (and lows), but a plethora of seismic changes shook the world, from incredible rival products to shocking scandals and everything in between. As the year draws to a close, we’ve taken a look back at the nine most important events in AI that took place over the last 12 months. It’s been a year like no other for AI -- here’s everything that made it memorable, starting at the beginning of 2023.
ChatGPT’s rivals rush to market

Read more