Business

8 Google Employees Invented Modern AI. Here’s the Inside Story

Byadmin March 20, 2024

The last two weeks before the deadline were frantic. Though officially some of the team still had desks in Building 1945, they mostly worked in 1965 because it had a better espresso machine in the micro-kitchen. “People weren’t sleeping,” says Gomez, who, as the intern, lived in a constant debugging frenzy and also produced the visualizations and diagrams for the paper. It’s common in such projects to do ablations—taking things out to see whether what remains is enough to get the job done.

“There was every possible combination of tricks and modules—which one helps, which doesn’t help. Let’s rip it out. Let’s replace it with this,” Gomez says. “Why is the model behaving in this counterintuitive way? Oh, it’s because we didn’t remember to do the masking properly. Does it work yet? OK, move on to the next. All of these components of what we now call the transformer were the output of this extremely high-paced, iterative trial and error.” The ablations, aided by Shazeer’s implementations, produced “something minimalistic,” Jones says. “Noam is a wizard.”

Vaswani recalls crashing on an office couch one night while the team was writing the paper. As he stared at the curtains that separated the couch from the rest of the room, he was struck by the pattern on the fabric, which looked to him like synapses and neurons. Gomez was there, and Vaswani told him that what they were working on would transcend machine translation. “Ultimately, like with the human brain, you need to unite all these modalities—speech, audio, vision—under a single architecture,” he says. “I had a strong hunch we were onto something more general.”

In the higher echelons of Google, however, the work was seen as just another interesting AI project. I asked several of the transformers folks whether their bosses ever summoned them for updates on the project. Not so much. But “we understood that this was potentially quite a big deal,” says Uszkoreit. “And it caused us to actually obsess over one of the sentences in the paper toward the end, where we comment on future work.”

That sentence anticipated what might come next—the application of transformer models to basically all forms of human expression. “We are excited about the future of attention-based models,” they wrote. “We plan to extend the transformer to problems involving input and output modalities other than text” and to investigate “images, audio and video.”

A couple of nights before the deadline, Uszkoreit realized they needed a title. Jones noted that the team had landed on a radical rejection of the accepted best practices, most notably LSTMs, for one technique: attention. The Beatles, Jones recalled, had named a song “All You Need Is Love.” Why not call the paper “Attention Is All You Need”?

The Beatles?

“I’m British,” says Jones. “It literally took five seconds of thought. I didn’t think they would use it.”

They continued collecting results from their experiments right up until the deadline. “The English-French numbers came, like, five minutes before we submitted the paper,” says Parmar. “I was sitting in the micro-kitchen in 1965, getting that last number in.” With barely two minutes to spare, they sent off the paper.

Business

Deadspin’s New Owners Are Embracing Betting Content—but Not AI

Byadmin May 15, 2024

Deadspin’s new ownership comes at a time when sports media is increasingly entwined with sports betting. Most major outlets, including ESPN, NBC, CBS, The Ringer, The Athletic, and Bleacher Report, have partnered with betting companies. What once might have been eyebrow-raising is increasingly accepted as standard practice, although some outliers, like Defector, still raise alarms…

Business

Sam Altman’s Sudden Exit Sends Shockwaves Through OpenAI and Beyond

Byadmin November 18, 2023

Earlier this month, Altman hosted OpenAI’s first developer conference and announced a plan to create an app store for AI agents built on top of its technology. Altman was also courting Middle East sovereign wealth funding for the development of AI chips that would compete with Nvidia’s, according to Bloomberg. The Information has previously reported…

Business

The Big-Tech Clean Energy Crunch Is Here

Byadmin June 3, 2024

Big Tech’s appetite for energy is just about visible from the east coast of Scotland. Some 12 miles out to sea sits a wind farm, where each of the 60 giant turbines has blades roughly the length of an American football field. The utility companies behind the Moray West project had promised the site would…

Business

Trump Squeezed America’s Geek Squad. Biden Built It Back Stronger

Byadmin September 8, 2023

Mina Hsiang returned to the United States Digital Service, the US government’s rapid digital fix-it squad, on January 26, 2021, when the streets of Washington, DC, had hardly been cleared after Joe Biden’s inauguration. She was one of the group’s founding members but had spent the past few years working for a health care startup….

Business

McDonald’s Ice Cream Machine Hackers Say They Found the ‘Smoking Gun’ That Killed Their Startup

Byadmin December 15, 2023

A little over three years have passed since McDonald’s sent out an email to thousands of its restaurant owners around the world that abruptly cut short the future of a three-person startup called Kytch—and with it, perhaps one of McDonald’s best chances for fixing its famously out-of-order ice cream machines. Until then, Kytch had been…

Business

China Tried to Keep Kids Off Social Media. Now the Elderly Are Hooked

Byadmin November 25, 2023

This part of the elderly population can live limited lives, moving between just three physical locations: the place they go to buy groceries, where they drop their children off from school, and their own community complex. Huang’s mother didn’t move far. Originally from Jiangxi, she moved to Zhejiang, a six-hour drive away, and the living…

Similar Posts