Skip to content

Project in collaboration with the Institut DATAIA on youth privacy protection and online gaming.

Notifications You must be signed in to change notification settings

kchardon/YPPOG_project_M1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Youth Privacy Protection and Online Gaming (YPPOG)

This research project was initiated during my Master 1 in Data & AI at the Institut Polytechnique de Paris in collaboration with Benjamin NGUYEN and Nicolas SOULIE at the Institut DATAIA. I am continuing to develop this project during my Master 2 studies and into my PhD.

Project Overview

The YPPOG project explores whether private data (e.g., age, gender, and region of residence) can be inferred from game data, with a primary focus on determining if a player is a minor. This research is critical for enhancing youth privacy protection in online gaming environments.

Data

Data Selection

An initial list of multiplayer games was compiled, focusing on those with accessible APIs for data retrieval. After testing multiple APIs, we selected the following:

  • Riot API: for League of Legends, Legends of Runeterra, and Teamfight Tactics.
  • Steam API
  • PlayerUnknown's Battlegrounds API

We currently focus on League of Legends due to its global player base, significant popularity, and extensive youth participation.

Data Collection

To gather data, we collaborated with player privacy protection organizations and university gaming associations. Participants completed a consent form and a survey, allowing us to collect demographic information and game data. Python scripts were developed to facilitate data cleaning and collection, which can be found within each game’s folder.

The main functions are:

  • cleanTarget(data : pd.DataFrame) : Available in targetCleaner.py, this function cleans demographic data collected from participants. It takes a pandas DataFrame as input and outputs a cleaned DataFrame with target variables and game identifiers.

  • getData(targets : pd.DataFrame) : Located in getDataMultiplePlayers.py, this function collects and cleans in-game data for each participant. It takes a DataFrame of target data and returns a DataFrame with game feature data.

Example usage of these functions is provided in getData.ipynb within each game folder. Currently, we are collecting data from all matches played since 2022.

Prediction Approaches

Our prediction task is structured around two primary methodologies:

  • Unsupervised Learning: Clustering the data to identify patterns that may correlate with demographic information without a predefined model.
  • Supervised Learning: Developing predictive models to infer demographic attributes, ideally with limited data, simulating a privacy attacker scenario.

Hypotheses for Feature Selection

Some initial hypotheses for identifying minors include:

  • Playing patterns: Minors may play mostly in the evenings or on weekends.
  • Session length: Minors may have shorter sessions due to parental controls.

Other demographic characteristics might also be inferred from gaming behavior:

  • Region of residence: Could potentially be inferred based on regional vacation times.
  • Employment status: Those unemployed may play more often during daytime hours.

Requirements

To utilize the functions in this project, install the required libraries listed in requirements.txt:

pip install -r requirements.txt

Note: API keys are required for data retrieval from each of the gaming APIs.

Future work

Several key areas remain for further development:

  • Updating the target cleaner function to align with the finalized online consent form.
  • Conducting in-depth analysis to examine correlations between game data and demographic attributes.
  • Collaborating with experts in each game to identify relevant in-game data that could contribute to privacy-related predictions.
  • Collect data.
  • Extending data collection functions to games beyond League of Legends.

About

Project in collaboration with the Institut DATAIA on youth privacy protection and online gaming.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published