Happy Valentine's Day! This week we're exploring Valentine's Day survey data. The National Retail Federation in the United States conducts surveys and has created a Valentine's Day Data Center so you can explore the data on how consumers celebrate.
The NRF has surveyed consumers about how they plan to celebrate Valentine’s Day annually for over a decade. Take a deeper dive into the data from the last 10 years, and use the interactive charts to explore a demographic breakdown of total spending, average spending, types of gifts planned and spending per type of gift.
The NRF has continued to collect data. The data for this week is from 2010 to 2022, as organized by Suraj Das for a Kaggle dataset. In the historical surveys gender was collected as only 'Men' and 'Women', which does not accurately include all genders.
If you're looking for other Valentine's Day type datasets, check out previous datasets on chocolate or board games (a good Valentine's Day activity!).
# Option 1: tidytuesdayR package
## install.packages("tidytuesdayR")
tuesdata <- tidytuesdayR::tt_load('2024-02-13')
## OR
tuesdata <- tidytuesdayR::tt_load(2024, week = 7)
historical_spending <- tuesdata$historical_spending
gifts_age <- tuesdata$gifts_age
gifts_gender <- tuesdata$gifts_gender
# Option 2: Read directly from GitHub
historical_spending <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2024/2024-02-13/historical_spending.csv')
gifts_age <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2024/2024-02-13/gifts_age.csv')
gifts_gender <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2024/2024-02-13/gifts_gender.csv')
- Explore the data, watching out for interesting relationships. We would like to emphasize that you should not draw conclusions about causation in the data. There are various moderating variables that affect all data, many of which might not have been captured in these datasets. As such, our suggestion is to use the data provided to practice your data tidying and plotting techniques, and to consider for yourself what nuances might underlie these relationships.
- Create a visualization, a model, a shiny app, or some other piece of data-science-related output, using R or another programming language.
- Share your output and the code used to generate it on social media with the #TidyTuesday hashtag.
variable | class | description |
---|---|---|
Year | double | Year |
PercentCelebrating | double | Percent of people celebrating Valentines Day |
PerPerson | double | Average amount each person is spending |
Candy | double | Average amount spending on candy |
Flowers | double | Average amount spending on flowers |
Jewelry | double | Average amount spending on jewelry |
GreetingCards | double | Average amount spending on greeting cards |
EveningOut | double | Average amount spending on an evening out |
Clothing | double | Average amount spending on clothing |
GiftCards | double | Average amount spending on gift cards |
variable | class | description |
---|---|---|
Age | character | Age |
SpendingCelebrating | double | Percent spending money on or celebrating Valentines Day |
Candy | double | Average percent spending on candy |
Flowers | double | Average percent spending on flowers |
Jewelry | double | Average percent spending on jewelry |
GreetingCards | double | Average percent spending on greeting cards |
EveningOut | double | Average percent spending on an evening out |
Clothing | double | Average percent spending on clothing |
GiftCards | double | Average percent spending on gift cards |
variable | class | description |
---|---|---|
Gender | character | Gender only including Men or Women |
SpendingCelebrating | double | Percent spending money on or celebrating Valentines Day |
Candy | double | Average percent spending on candy |
Flowers | double | Average percent spending on flowers |
Jewelry | double | Average percent spending on jewelry |
GreetingCards | double | Average percent spending on greeting cards |
EveningOut | double | Average percent spending on an evening out |
Clothing | double | Average percent spending on clothing |
GiftCards | double | Average percent spending on gift cards |
Data was downloaded from Sunja aa Kaggle dataset.
Data from historical_gift_trends_per_person_spending.csv, historical_spending_average_expected_spending.csv and historical_spending_percent_celebrating.csv were combined into historical_spending.csv.
Data from planned_gifts_age.csv and spending_or_celebrating_age_1.csv were combined into gifts_age.csv.
Data from planned_gifts_gender.csv and spending_or_celebrating_gender_1.csv were combined into gifts_gender.csv.
Percentage signs and dollar signs were removed from all numerical values.