Why captions are everywhere on TikTok: ‘Glasses for your ears’

Influencer Alyssa McKay is surrounded by lighting equipment.
Influencer Alyssa McKay, 21, poses behind the scenes of one of the rap videos she regularly posts to TikTok.
(Levi Underhill)

TikTok creator Alyssa McKay has perfected a persona people love to hate: a Regina George-esque rich girl whose bouncy raps are packed with in-jokes about Met Gala invites and counterfeit Dolce & Gabbana purses.

But despite their fun lyrics and high production values, her sketches are far from guaranteed to hold the interest of her 8.7 million followers, who may be scrolling their feeds with their phones on mute while at work or watching TV, or whose English might not be fluent enough to keep up with her torrent of references and disses.

For McKay, the solution to many of these obstacles is the same: subtitles.

“Our attention spans are so short now that I feel like closed captioning, and putting the captions up on the screen if closed captioning isn’t an option, is so crucial,” said McKay, 21, who lives in Portland, Ore. “It captures the attention of people, and they’re able to actually process what’s going on.”

A video app known primarily as a place to watch dances and lip-syncs set to popular songs, TikTok has emerged as an unlikely forum for text in the form of its increasingly ubiquitous captions. Open the app and you’ll see them everywhere: overlaid atop memes, embedded in stand-up comedy clips, flashing by in movie trailers. Users can write their own — in a number of eye-catching fonts and colors — or, as of April, let TikTok auto-generate them. The platform’s closed captioning hashtag, #cc, sits at more than 4 billion views.

That might seem surprising. After all, subtitles are a notoriously big obstacle for American movie audiences, who rate foreign-language films the lowest of all film genres. And the Gen Z-ers who make up TikTok’s main demographic grew up on an internet rich with video and audio, where reading was strictly optional.


Although there’s a drift in digital technologies away from text and toward “a more multimedial representation of reality,” UC Davis communications professor Martin Hilbert said, TikTok subtitles complement rather than replace the app’s core video elements.

“It is part of a general trend to more fully represent reality,” Hilbert said, citing an academic framework called media richness theory.

TikTok isn’t the first social network to overlay words on videos. YouTube launched automatic captioning in 2009; the thin black strip of text that Snapchat lets users paste over their photos and videos is perhaps the most recognizable of the app’s ever-increasing list of features. Soon after TikTok launched its auto-captioning tool, Instagram released one of its own.

And the trend isn’t confined to social media: Watching television with subtitles appears to be increasingly commonplace, for reasons as varied as sleeping housemates, ambient background noise, dialogue-muffling sound mixes and the wider range of accents in mainstream entertainment.

But on TikTok, a mix of factors — including the incentive structures that motivate influencers to produce certain types of content, widespread shifts in how people consume online media and the successful advocacy of disability rights groups — have all come together to make subtitles and captions a particularly prominent feature of the most downloaded app in the world last year.

Meet Lynette Adkins. She’s part of a new crop of creators focusing their content on the how-to of the business: making money from an online following.

Aug. 12, 2021

McKay said she began writing captions for her TikToks last summer, around the same time she started to think about her presence on the platform as an actual career. When TikTok launched the auto-captioning tool, she quickly latched on to it.

It’s not just a strategy she uses for her music videos either. More traditional influencer post types — an outfit montage; a reaction video — come with subtitles of their own, sometimes hand-typed and other times auto-generated. (Closed captions are those that can be turned on and off. Only the auto-generated ones fit that bill; those that creators write themselves are open captions.) Some of her posts don’t include any speaking at all.

McKay sees text as serving several functions, such as helping fans appreciate the “little innuendos” she slips into her lyrics and serving as a sort of teaser for new viewers.

“When you’re scrolling through and you see this big text above someone’s head,” she said, “you’re like, ‘Oh, what is that? What does that say?’”

From TikTok tarot to remote reiki, COVID-19 drove mediums, clairvoyants and other “spirit workers” online.

July 1, 2021


Overlaying videos with text may even encourage TikTok’s opaque recommendation algorithm to boost their visibility, she said — although it’s hard to know for sure.

Brian Mandler, who manages McKay through his marketing agency the Network Effect, said that part of the appeal of subtitled TikToks is that they feel like “modern-day karaoke,” allowing fans to sing along with McKay or make their own videos using the same audio. A YouTube compilation he shared shows more than a dozen accounts replicating one of McKay’s raps.

Captions, he said, may also direct someone’s focus to a single piece of content if they have multiple devices open in front of them — a state of distraction called “continuous partial attention.” “In order to watch the video, hear the sound, but then also read the captions, you know that their focus is going to be on that device,” Mandler said.

Dan Greenberg, president of the ad exchange Sharethrough, said company data show that 75% of people overall, and 86% of millennials, keep their phones muted throughout the day. That means that if they ever encounter a video — be it a post by an influencer such as McKay or an ad from one of Sharethrough’s clients — they’ll probably need captions to understand it.

“If there’s a video that you’re watching on your phone and it’s on mute, and you don’t really want to turn your sound on, and there’s no captions, you generally just scroll past it,” Greenberg said. “That video just blends into the background.”

It’s a shift in how people engage with content that many advertisers haven’t yet caught up with, he said: “Most marketers are still pushing the traditional 15s and 30s [commercials] from TV, without captions, that are mostly incomprehensible when the phone is on mute.”

Text-heavy TikToks are also easier to share across other platforms because they can be screen-shot without losing as much context.

And the frenetic pacing and rapid-fire dialogue used in many TikToks may also be making captions increasingly important on the platform.

Paula Winke, a Michigan State University linguistics professor who has researched the educational benefits of captioning, described captions as “glasses for your ears” that can make it easier to parse dialogue. That’s especially important when it comes to online videos that have had speakers’ natural pauses edited out, depriving viewers’ brains of the momentary rest they need to fully absorb what’s being said.

Captions can also make social media content “more accessible to a larger, global market,” Winke said via email. That means more fans for influencers and more users for TikTok.

Of course, the prevalence of captioning on TikTok and in other parts of the internet isn’t just a question of capturing market share in the attention economy; it’s in large part the result of advocacy of accessibility for the deaf and hard of hearing. TikTok has said it works with creators and disability advocates to develop accessibility tools such as the auto-captioning feature.

“The accessibility movement and the convenience of using speech-to-text [and] speech-recognition tools are leading factors in seeing more short videos captioned,” Eric Kaika, chief executive of the accessibility group TDI (previously Telecommunications for the Deaf and Hard of Hearing), said by email. But, he added, it’s not yet a universal trend.

Howard A. Rosenblum, CEO of the National Assn. of the Deaf, agreed: However commonplace captioned video has become, social media is still a challenging place for deaf and hard-of-hearing users.

“Most social networks do not make accessibility features easy or universal,” Rosenblum said via email. “Such networks should have dedicated accessibility teams on their payrolls that include people with disabilities to test every feature or program to pass universal design requirements before any launch.”

There’s still a ways to go before all online content comes with high-quality captioning. But if TikTok is any indication, there’s substantial interest in getting to that point — not just from accessibility advocates, but also from content creators and social media-savvy consumers.