Recommended Advertising on YouTube: Mapping Demographic Variation and Targeting Strategies in Italy

Team Members

Ellen Gustafsson, Federico Irace, Kata Horváth, Lawrence Paul Cabrera, Nicole Sanchotene, Andrea Russo (Project Leader)

1. Introduction to the project

Digital platforms frame the consumption of content, both directly and through their infrastructures, sustaining the distribution of targeted advertisements. This project aims to map patterns of advertisements on YouTube, focusing on how targeting strategies vary across age, gender, location and education, through an approach informed by digital methods. Students will draw from a dataset of ads recommended to users on Italian YouTube, which includes the date the ad was served and the user to whom it was recommended. Through a qualitative-quantitative approach, the project aims to analyze how advertisements vary across different demographics and examine the degree of personalization in online advertising, considering whether it operates through individual targeting or broader demographic categories, such as gender or age. The project also explores which kinds of advertisements can be considered ‘omnivorous’ (i.e., aimed at all consumers indiscriminately) and which are ‘specialized’ (i.e., directed toward a specific segment of the population) in their targeting strategies.

2. Research Questions

What kind of advertisements are different categories of demographics exposed to on YouTube?
What are the strategies behind firms’ ad targeting choices?

3. Methodology

Sample

The primary source of data originated from the AlgoFeed Project (https://algofeed.unimi.it/), which provides a structured infrastructure for collecting online platform data through voluntary data donation by users. This approach enabled us to capture a broad and ecologically valid pool of YouTube video advertisements as they appeared in real user environments. The sample of advertisements was developed through voluntary contributions from Italian YouTube users participating in AlgoFeed’s data donation project.
From the broader data collected via AlgoFeed, we were given a random sample of 500 YouTube advertisements for in-depth analysis. The dataset consists of a sample of YouTube advertisements that users reported encountering on the platform. For each video, the dataset includes metadata such as the video ID, URL, and title, along with additional information like duration and automatically extracted named entities. The most important element for our analysis is the categorical tag that we assigned to each video, indicating the intent of the advertisement (e.g., Brand) or the sector of the promoted product, service, or experience (e.g., Food, Fashion, Technology).
This means the dataset combines both structured information (video identifiers, durations, tags) and semi-structured content (titles and entities), allowing us to examine advertisements not only by their descriptive attributes but also by how they are positioned within broader sociological categories of consumption and promotion.

Tagging

Following the collection of the random sample of 500 YouTube advertisements without any accompanying sociodemographic data, the next step of the methodology involved the development of a codebook for systematic categorization.
To analyze these ads systematically, we developed a tagging scheme that allowed us to categorize each video according to its primary purpose and content. Our first distinction was based on intent: we separated ads that aimed to promote a company’s activity and overall identity from those that advertised a specific product, service, or experience. The former were grouped under the category “Brand”, capturing cases where the emphasis was on the business itself, its reputation, recognition, or values, rather than on a concrete offering.
For the remaining advertisements, we adopted a sectoral approach, assigning tags according to the use and type of product, service, or experience being promoted. These categories may indeed reflect the different domains of consumption and cultural practice that advertising engages with. In this way, our tagging scheme combines both an intent-based distinction and a content-based classification, allowing for a meaningful analysis of how advertising may target different aspects of everyday life.

TagDescription
BrandAds that highlight the business and overall image, values, or identity of a company rather than selling a specific product or service.   e.g., L’ORÉAL PARIS | Porta in Scena il Tuo Valore
CarCovers advertisements for automobiles, motorcycles, and other motorized means of transport.   e.g., Hyundai Tucson HEV 15s NoPromo 1080×1920 Reels NoLogo
ConsumableRefers to everyday items that are quickly used and regularly replaced.   e.g., POLVERE SU TUTTI I MOBILI? NON CON SWIFFER DUSTER! | SWIFFER. SE LO PROVI, TI CATTURA!
Digital ProductApplies to goods or services that exist online, such as apps, software, or online services.   e.g., Work Anywhere Device IT    Adobe Document Cloud
DomesticCovers durable household goods and furnishings that are typically longer-term investments.   e.g., DIVANO 15 SEC YOUTUBE
E-commercePromotions from online shopping platforms.   e.g., Amazon Shipping (IT) 2023
EventAds for specific happenings such as exhibitions or fashion shows.   e.g., KENZO SS24 RUNWAY SHOW BY NIGO HIGHLIGHT
FashionClothing, accessories, and style-related promotions.   e.g., PINKO presents Deepest Desires – SS24 Shoes Collection #PINKOSS24 #PINKODeepestDesires #Shoes
FoodFood and beverage-related products.   e.g., Nuovo Lindt Choco Wafer
HobbyProducts and services linked to leisure and personal interests (e.g., travel, cards, board games). Ads appeal to creativity, enjoyment, and lifestyle enrichment.   e.g., Bring on Ecuador with G Adventures!
MediaRefers to visual media products and services such as television shows, films, and online videos.   e.g., The Killer | Netflix Italia
MusicCovers songs, albums, and music streaming platforms.   e.g., Kollektiv Turmstrasse: YAP Short Clip
No DataUsed when the available information about the video was incomplete, unclear, or impossible to classify.   e.g., ios_gg_450ac250bc55f27ca9e23f34300d6686_video_0915L007_IT_iOS_GI_Expand_Video_0911.mp4
Self CareProducts and services for personal wellbeing, beauty, and health.   e.g., Minéral 89 Crema Idratante 72h
SportCovers advertisements related to sports as a form of entertainment, focusing on events or sports broadcasting.   e.g., F1® is in the air 🇶🇦🏎️
TechnologyCovers advertisements for technological devices and hardware that are primarily physical products.   e.g., Galaxy Z Flip5 | Samsung
Video GamePromotions for video games, consoles, or gaming services.   e.g., Game of Empires: Warring Realm

Analysis

Only after completing the coding process did we reintroduce sociodemographic data from the AlgoFeed participants into the analysis. This sequencing was a deliberate methodological choice aimed at reducing bias and avoiding circular reasoning. By first working exclusively with the advertisement sample on its own terms, we avoided the temptation to let preconceptions about targeting practices or user profiles shape the coding of ad content. When demographic data such as age groups, gender distribution, or education levels were finally considered, it was solely as a secondary interpretive layer. This allowed us to examine whether certain types of advertisements clustered more strongly around specific user demographics, while ensuring that the foundational categorization of ads remained independent of these factors.

ProvinceN
Naples231
Milan60
Varese48
Cremona37
Bergamo36
Avellino20
Pavia13
Caserta13
Salerno10
Monza and Brianza10
Lecco7
Benevento6
Brescia6
Como1
Lodi1
RegionN
Campania280
Lombardy219
Parents EducationN
High School Diploma229
Middle School Diploma166
Bachelor’s Degree34
Vocational Qualification31
Master’s Degree (or Single Cycle)15
PhD13
Postgraduate Master (I or II level)10
No Degree1
Education LevelN
High School Diploma247
Bachelor’s Degree162
Master’s Degree (or Single Cycle)54
Middle School Diploma22
PhD11
Postgraduate Master (I or II level)3
Age GroupN
35–40231
25–34136
18–24132
GenderN
Female295
Male204

4. Findings

After completing the stages of classification and analysis, we observed that the advertisements, when grouped by category, were distributed as shown in the table below.

TagN
No data96
Brand92
Media50
Food47
Self care39
E-commerce32
Digital product27
Domestic21
Fashion20
Technology20
Video game20
Consumable10
Car6
Hobby6
Event6
Sport4
Music3
Home (error)1

The distribution of gender across advertisement tags shows that Female and Male users appear in roughly similar proportions across most categories, though females are slightly more represented in No data, Brand, Media, and Fashion, and significantly more in Self care, E-commerce, and Food. Males are somewhat more represented in Domestic and Video games. Many categories, especially those with lower counts such as Sport, Music, show balanced and/or low representation between genders. The Brand tag, defined as ads that highlight the overall image, values, or identity of a company rather than selling a specific product, is present for both male and female groups, with a tilt toward females. Most significant differences, with a higher representation of females is visible in the categories Food, Self care, and E-commerce, and for male in Domestic and somewhat in Video Game. Notable is that females are targeted more times with ads in Technology.

Age-based distributions showed clustering of ads. The 18–24 and 25–34 groups appeared equally in multiple categories such as Media, Food, Self Care, Hobby, and Event. Ads in Technology and Car categories were more evenly distributed between 25–34 and 35–40 . Only E-commerce is exposed almost equally to the three age groups.

The distribution of education shows that High School Diploma and Bachelor’s Degree dominate across multiple tags, especially in Brand and Fashion. Ads in Technology and Video Games also show contributions from Master’s Degrees, while Middle School Diploma and PhD are less common. The Sport and Consumable categories are more evenly distributed across lower and mid-level educational backgrounds. Overall, educational segmentation is present, with higher education levels more represented in technology-related categories and lower levels in lifestyle-related ones.

Parental education is most frequently reported as High School Diploma or Middle School Diploma across nearly all categories. These dominate in No data, Brand, Media, Self care, Car and Food. Only in the Food category do individuals with a middle school diploma dominate the ads they are exposed to. Higher parental education (Bachelor’s Degree, Master’s Degree, PhD) appears in smaller proportions, more visible in Technology, Video games. The higher educational level of Masters Degree is only represented in Brand, Media, Food, Technology, and Video Game.

When looking at regional distribution, Campania and Lombardy are the two regions represented. Campania has higher counts in No data, Media, Food, E-commerce, Digital Product, Domestic, Technology, Video Game, Fashion, and Hobby. Lombardy is more represented in Brand, Car, Sport, and Music. Categories like Fashion and Consumable are relatively evenly distributed between the two.

At the provincial level, Naples dominates the distribution across nearly all categories, followed by Milan. Other provinces such as Caserta, Salerno, Avellino, and Cremona appear but with much smaller counts. Naples has particularly high representation in No data, Brand, Media, Food, Self Care, Digital Product, Technology, Video Game and Consumable. Milan appears most significantly in Brand, Self Care and Consumable. Overall, the data shows a strong concentration of ads in a few provinces, with many provinces barely represented.

5. Discussion

The analyzed sample revealed a predominance of advertisements aimed at institutional brand promotion, rather than campaigns exclusively focused on the immediate sale of products or services. This finding suggests that advertising on YouTube, for the audiences considered, largely functions as a strategy to reinforce brand reputation and visibility, operating as a form of continuous presence in the digital environment. Rather than solely stimulating direct consumption, these advertisements appear to serve the purpose of reminding users of the brand’s existence and relevance, consolidating its position in the collective imagination while fostering familiarity and trust over time. This dynamic aligns with Carah and Brodmerkel’s (2021) observation that brands engage with, shape, and capitalize on “algorithmic culture,” leveraging platform infrastructures not only to sell products but to sustain cultural visibility and authority.
Two further points emerge from the distributional patterns observed across tags once sociodemographic variables were introduced post-coding, with implications for how personalization appears to operate on Italian YouTube. First, several categories that might be expected to be strongly gendered, such as Technology or Fashion, showed only modest skews rather than sharp separations, while “omnivorous” classes like Food and Media remained broadly distributed. Second, the prevalence of the Brand and No Data tags highlights interpretive limits but also analytic opportunities. The Brand tag’s prominence may indicate a strategic emphasis on reputation and trust-building in a cluttered attention economy, while the No Data bucket – kept deliberately to ensure coding transparency in our analysis – acts as a reminder that classification granularity is co-determined by creative disclosures, metadata quality, and extraction pipelines, not only by coder judgment.

6. Conclusion

Unlike television, for instance, where the entire audience receives the same advertisements, on the internet, ads are “tuned” to each user based on socio-psychological profiles built in real time through data collection (Carah et al. 2024; Beauvisage et al. 2024). The analyses carried out make it possible to assess, at least partially, how demographic criteria — such as age, gender, or education — are mobilized to promote advertisements to specific audiences. This is a relevant contribution, given that it is currently not possible to precisely observe how the distribution of ads occurs for each individual on the platform. An important limitation, however, is that the research cannot capture the full range of parameters used by recommendation algorithms, which likely include contextual and behavioral variables in addition to the demographic data analyzed. Future research could address this challenge by combining broader datasets, controlled browsing experiments, or algorithmic auditing techniques, thereby enabling a more detailed understanding of how different dimensions of user profiles influence the personalization and distribution of advertisements.

7.   References

– Beauvisage, T., Beuscart, J. S., Coavoux, S., & Mellet, K. (2024). How online advertising targets consumers: The uses of categories and algorithmic tools by audience planners. New Media & Society, 26(10), 6098-6119. https://journals.sagepub.com/doi/10.1177/14614448221146174

– Carah, N., and Brodmerkel, S. (2020). Critical perspectives on brand culture in the era of participatory and algorithmic media, Sociology Compass, 2-12. https://doi.org/10.1111/soc4.12752

– Carah, N., Hayden, L., Brown, M. G., Angus, D., Brownbill, A., Hawker, K., .& Robards, B. (2024). Observing “tuned” advertising on digital platforms. Internet Policy Review, 13(2), 1-26. https://policyreview.info/articles/analysis/observing-tuned-advertising-digital-platforms

– Punziano, G.; Gandini, A.; Caliandro, A.; Airoldi, M.; Padricelli, G.M.; Acampa, S.; Trezza, D.; Crescentini, N.; Rama, I. 2024. Eliciting and Retrieving the Feedback-Loop. Exploring Elicitation Interview Techniques for Detecting Algorithmic Feedback on Social Media and Cultural Consumption. In: 6th International Conference on Advanced Research Methods and Analytics (CARMA 2024). Valencia, 26-28 June 2024. https://doi.org/10.4995/CARMA2024.2024.17835

– Punziano, G.; Gandini, A.; Caliandro, A.; Airoldi, M.; Padricelli, G.M.; Acampa, S.; Trezza, D.; Crescentini, N.; Rama, I. 2024. The Algofeed project. A methodological proposal to assessing the effects of algorithmic recommendations on platformized consumption.. In: 6th International Conference on Advanced Research Methods and Analytics (CARMA 2024). Valencia, 26-28 June 2024. https://doi.org/10.4995/CARMA2024.2024.17834