Week 9. Mar.7: Multi-Modal Learning and Explainability - Possibilities #21

avioberoi · 2025-03-06T13:49:07Z

Post a link for a "possibility" reading of your own on the topic of Auto-encoders, Network & Table Learning [Week 9], accompanied by a 300-400 word reflection that:

Briefly summarizes the article (e.g., as we do with the first “possibility” reading each week in the syllabus)
Suggests how its method could be used to extend social science analysis
Describes what social data you would use to pilot such a use with enough detail that someone could move forward with implementation

CongZhengZheng · 2025-03-07T03:14:06Z

The paper AGENT AI: Surveying the Horizons of Multimodal Interaction explores the development of Agent AI, a class of interactive AI systems that integrate multimodal perception, human feedback, and embodied actions. The authors argue that Agent AI represents a pathway toward Artificial General Intelligence (AGI) by enabling models to process visual, linguistic, and environmental data in real-time interactions. Unlike traditional AI systems that operate in limited, predefined environments, Agent AI is designed to adapt dynamically across both physical and virtual spaces. The paper highlights key challenges, such as reducing AI hallucinations, mitigating biases, ensuring interpretability, and improving real-world integration. Additionally, it explores how large foundation models (LLMs and VLMs) can be leveraged for embodied AI systems in domains such as robotics, gaming, and healthcare.

The methodology introduced in this paper has profound implications for social science research, particularly in the study of human-computer interaction, digital behavior, and adaptive learning environments. Social science often relies on observational studies, surveys, and controlled experiments, which can be resource-intensive and limited by ethical concerns. By using Agent AI systems, researchers could simulate complex social behaviors and test hypotheses at scale. For example, AI-driven agents could model online discourse, decision-making in group dynamics, or responses to misinformation campaigns in a controlled setting. This would allow for replicable and dynamic studies of human-like behavior in diverse contexts.

To pilot such an application, I would propose using real-world multimodal interaction data from publicly available conversational datasets, such as Reddit threads, YouTube comment sections, or Twitter discussions, combined with gesture, speech, and facial expression data from video-based communication platforms like Zoom or Microsoft Teams. By integrating sentiment analysis, discourse modeling, and behavioral tracking, an Agent AI system could simulate how individuals respond to different social cues, misinformation, or emotionally charged interactions. Researchers could then modify environmental variables (e.g., introducing fact-checking interventions or varying social network structures) to study how different sociocultural factors shape online discourse and decision-making. This approach would provide scalable, ethical, and repeatable models for studying social interactions in digital spaces and beyond.

yangyuwang · 2025-03-07T04:56:11Z

C. Choi, S. Yu, M. Kampffmeyer, A. -B. Salberg, N. O. Handegard and R. Jenssen, "DIB-X: Formulating Explainability Principles for a Self-Explainable Model Through Information Theoretic Learning," ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Korea, Republic of, 2024, pp. 7170-7174, doi: 10.1109/ICASSP48485.2024.10447094.

Summary of the Article:

The DIB-X model introduces a self-explainable deep learning approach that aligns with explainability principles using an information-theoretic learning framework. Unlike traditional post-hoc explainability methods that interpret deep models after training, DIB-X integrates explainability directly into the learning process.

DIB-X employs Rényi’s α-order entropy functional to measure mutual information while avoiding assumptions about data distributions. It applies deep deterministic information bottleneck (DIB) learning to balance information retention and classification performance. The study validates this approach using datasets such as MNIST, marine monitoring images, and echosounder data, demonstrating both improved interpretability and accuracy compared to existing models like Grad-CAM and VIB-X.

Extending Social Science Analysis

The self-explainability framework of DIB-X offers significant potential for social science research, particularly in policy analysis, media studies, and public opinion research. Current machine learning models in social sciences often struggle with interpretability, making it difficult to understand why a model arrives at certain conclusions. DIB-X, however, provides an inherently explainable way to analyze complex multimodal datasets.

Pilot Study for Implementation

We could implement this idea into the image detection. For example, I would like to predict how a painting belongs to styles. After training the NNs, the DIB-X could show which parts of the painting are more likely to related to certain styles. In this way, we could see how one style related to some specific parts in the paintings.

The pilot study could use established artwork datasets, and predict the styles labeled by art critics. Then we can visualize the parts related to certain styles.

psymichaelzhu · 2025-03-07T05:05:00Z

Reflection on “When Continue Learning Meets Multimodal Large Language Model: A Survey” (Huo & Tang, 2024)

Summary
Huo and Tang (2024) provide a comprehensive review of continual learning (CL) in multimodal large language models (MLLMs), focusing on how these models adapt to dynamic data distributions and evolving tasks without suffering from catastrophic forgetting. The paper reviews 440 research works on MLLM continual learning, categorizing them into three domains: (1) Continual learning in non-large unimodal models, (2) Continual learning in non-large multimodal models, and (3) Continual learning in large language models (CL in LLMs). The authors highlight key challenges, such as maintaining prior knowledge while learning new tasks, efficiently adapting to different domains, and overcoming computational constraints. The review also explores model innovation strategies, evaluation benchmarks, and practical applications in areas like healthcare, education, and robotics. Finally, the authors discuss future research directions, emphasizing the need for improved evaluation metrics, interpretability, and scalable methods for CL in MLLMs.
Insights for social science
In the context of continual learning (CL) in multimodal models, models must integrate new knowledge across modalities while retaining prior information. An interesting aspect is to understand how different modalities "overwrite" each other. If certain modalities are easily fused by the other, it may suggest that they share overlapping representation (like weights) in the model, leading to potential interference in continual learning. This raises the question: Can fusion ease provide insights into the overlapping structure of multimodal representation in multimodal models (and further, human brain)?
Proposal
To empirically examine this, I propose a longitudinal CL experiment:
1. Train a baseline multimodal model on an initial dataset with a single modality (like audio).
2. Introduce new knowledge from another modality (like vision) incrementally, while tracking changes in weights over time.
3. Identify the weights that change, revealing the shared representation structure between modalities (audio & vision), while the remaining ones correspond to modality-specific representations (audio or vision).
4. This structure could serve as a potential architecture of multi-modality processing. By comparing it with neural data, we may gain deeper insights into how the brain—the most famous machine capable of processing multidimensional information—integrates and separates modalities.

Sam-SangJoonPark · 2025-03-07T05:30:55Z

https://www.nature.com/articles/s43588-023-00573-5

Summary:
This study introduces a novel approach to modeling human life events using a linguistic framework, leveraging natural language processing techniques to analyze and predict life trajectories. By utilizing Denmark’s comprehensive registry data, which includes daily records of health, education, employment, income, address, and work hours, the study creates an embedded vector space representing life events. The resulting model demonstrates a robust and structured representation of life trajectories, significantly outperforming traditional models in predicting outcomes such as early mortality and subtle personality traits. Additionally, deep learning model interpretability techniques are applied to uncover the key factors influencing these predictions. This framework provides researchers with a new way to explore the underlying mechanisms shaping life outcomes and suggests opportunities for personalized interventions.

The methodology proposed in the article presents an exciting opportunity for extending social science research, particularly in the study of life course dynamics. Traditionally, social scientists have relied on surveys, longitudinal studies, and census data to analyze life trajectories. However, these methods often suffer from limitations such as recall bias and missing data. By embedding human life events into a structured vector space, researchers can analyze patterns with unprecedented granularity, allowing for predictive modeling of social mobility, economic inequality, and health outcomes.

Insight for Social Science:
One particularly promising extension is in the field of policy evaluation. Governments and social institutions could use this approach to assess the long-term effects of interventions such as education reforms, welfare programs, and public health initiatives. By analyzing historical data, researchers could determine which interventions yield the most significant improvements in life outcomes and refine policies accordingly.

Data for Pilot Implementation:
To implement such a model, a pilot study could leverage hospital medical records, assuming appropriate ethical approvals and patient consent. These records would provide valuable data points on health events, medical treatments, and lifestyle factors that influence long-term outcomes.

In addition to medical records, smartphone sensor data could serve as a complementary source of information. With user consent, smartphone applications could collect passive data on movement patterns, screen time, social interactions (e.g., call and text frequency), and app usage, providing real-time insights into behavioral trends. By integrating these datasets, researchers could extract key life events and construct a multi-modal model of human life trajectories.

By implementing these techniques, we could unlock new dimensions in social science research, offering more precise and dynamic insights into how life events shape individual and societal outcomes.

zhian21 · 2025-03-07T05:31:45Z

Guilbeault et al. (2024) investigate how online images, particularly those from search engines and social media, amplify gender bias more than textual content. Using large-scale computational analysis of over one million images from Google, Wikipedia, and IMDb, alongside billions of words from these platforms, they find that gender bias is significantly stronger in images than in text. Their experimental results show that participants exposed to image-based search results develop stronger explicit and implicit gender biases about occupations. The study highlights how the shift toward visual content can exaggerate societal stereotypes, reinforcing pre-existing inequalities.

From a network learning perspective, this study provides a unique opportunity to explore how social biases propagate across digital platforms using autoencoders and network-based table learning. Specifically, autoencoders—which learn compressed representations of high-dimensional data—could be leveraged to model latent gender biases in multimodal datasets (text, images, and user interactions). Moreover, a graph-based learning approach could be used to study how biases diffuse across networks of users, content creators, and algorithmic recommendations on platforms like Google and Wikipedia.

To extend this approach in social science, we could apply autoencoders to de-bias image search algorithms by training models to distinguish bias-related image features (e.g., occupational stereotypes in images). Network-based learning could further help trace the evolution of gender associations across search algorithms over time, identifying how feedback loops reinforce bias.

A pilot study could collect Google Image search results for various professions (e.g., “scientist,” “engineer,” “nurse”) across different geographic regions and compare their gender representations to labor force demographics (e.g., U.S. Census, Eurostat). An autoencoder trained on these images could learn a low-dimensional bias representation, quantifying the extent of stereotypical portrayals. Finally, a network-based approach could model user engagement patterns, examining how exposure to biased content affects subsequent searches and content recommendations.

This application could inform algorithmic fairness interventions, helping researchers and policymakers design bias-mitigating strategies for search engines, recommendation systems, and AI-generated media.

lucydasilva · 2025-03-07T06:04:42Z

I thought "A Unified Model of Human Semantic Knowledge" was a very interesting article -- it was able to combine brain imaging and patient data to define a theory of word-object relations that balanced theories that emphasize the locality of word-object relations to specific category domains (animals are grouped, tools are grouped -- perhaps in a similar way to word embeddings in vector space?) and theories that emphasize a general domain or approach to word-object relations. With a theory called C3 (connectivity-constrained cognition), the authors contend that word-object relations are crystallized in the cortex through learning/experience, perceptual/linguistic/somatic framing of environment, and neural connectivity in the brain.

With this framework, the authors can provide a normative framework that structures disorders and pathologies that affect the mind, and a general theory of brain processing that maps neatly onto neural network patterns. This gets at the problem of both the multimodality of brain processing (learning/perceiving/classifying through images, speech, somatic experience etc) and also explainability -- it provides a theory of the brain and the brain's processing of word-object relations that informs how neural networks take inputted information and produce an output.

Whether this theory of brain has been retrofitted to address neural network explainability conundrums, emerges as a way to assert consciousness as a reflection of neural network processing, or is coincidentally analogous to NNs is up for debate. But it is an interesting theory nonetheless that allows for. a better understanding of how NNs are a reflection of brain processes -- or, even more interesting, vice versa.

chychoy · 2025-03-07T06:07:05Z

The article “A Transformer Approach to Detect Depression in Social Media” by Keshu Malviya, Dr. Bholanath Roy, and Dr. Saritha SK discusses using deep learning approaches in detecting early symptoms of depression, especially in the post-COVID era, where more people are online and feeling more isolated. The social media data are collected from “non-depressed’ and “depressed” subreddits of the Reddit platform through the Pushshift API. The two categories are not clinical differences, but rather more tonal. The authors specifically defines their depressed posts as “depressed in nature,” which is a relatively vague and unclear metric. It uses TF-IDF models and Word2Vec models as baselines, and applied transformer models as comparisons. The transformer outputs seem generally more effective, with classification accuracy scores around 0.96-0.98.
Initially, I thought that the dataset considers “depression” on the clinical level, and considers using typical behaviors (such as messages and posts on social media) as one of the inputs to better understand behavior in people who are clinically depressed. Upon further review, the task in the paper seems to be closer to sentiment analysis through transformer models. However, I do still believe that there is still social sciences meaning to this experiment if one could collect more data on social media behaviors for people who have actually been diagnosed. Furthermore, instead of simply training the model to classify, a more valuable use-case might be to put different sets of social media behavior in a vector space in relation to other people.
This idea, as it concerns mental health, one of the most private and sensitive information of people, needs to thoroughly consider its data collection and ethical practices. On a basic level, the study needs to gain access to a social media user’s mental health and much of their social media records. Furthermore, the study also needs to consider those who might be undiagnosed, or people who simply grew up in different cultures.

haewonh99 · 2025-03-07T08:09:17Z

"Towards artificial general intelligence via a multimodal foundation model" integrates image and text data to propose BriVL(Bridging-Vision-and-Language), a multimodal foundation model. It uses large-scale weakly correlated image-text data. It's goal is to model human cognitive tasks, including vision, language, and cross-modal understanding. The model was trained on 650 million image-text pairs collected in the web with self-supervised learning. The distinctive characteristic of this model is that it utilizes weak semantic correlations that learn more abstract, generalized understanding. It also uses separate image and text encoders to embed images and text in a joint embedding space. The model could generate contextual and abstract representations of text descriptions and performed well on reasoning, classifying, and image-to-text and text-to-image retrieval tasks.

I think this would be an excellent model for studying prejudice and the chain of thoughts, i.e., mental representations that people have, because it learns abstract representations. Also, if trained on corpus-image association from different cultures, it would also be great for comparison of culture. I think an interesting idea would be to train it on various fiction corpus, perhaps with book covers and text of the book, from different cultures. Then, we could ask it to generate an image from the same abstract sentences, like "imagine utopia", "this is the best day in spring". It would have learned what the representative of those abstract sentences would refer in each culture, although I'm not sure about how I could generalize results from each sentence, I think it would be an interesting thing to try and off-the-shelf to compare mental associations from different cultures.

baihuiw · 2025-03-08T20:21:43Z

The article discusses the challenge of societal biases in multimodal AI, particularly how certain AI-generated outputs can reinforce stereotypes. What are some concrete strategies that developers can implement to detect and mitigate these biases during both the data collection and model training phases?

shiyunc · 2025-03-09T04:38:35Z

Re: Guilbeault, D., Nadler, E. O., Chu, M., Sardo, D. R. L., Kar, A. A., & Desikan, B. S. (2020). Color associations in abstract semantic domains. Cognition, 201, 104306.

This article explored an interesting question in embodied cognition: does sensory data (e.g., color) contribute to the semantic structure of abstract concepts? It tested three domains of abstract concepts: disciplines, emotions, and music genres. They applied a multi-modal learning method to project words and their Google images to their digital representation and examined the correlation and clustering. The results show that color variability increases with concept abstractness, and the color distributions of the words show clustering. It supported the hypothesis that color plays a role in the semantics of abstract concepts.

A question I'd like to ask is: can we spread this conclusion to broader abstract concepts? Emotion and disciplines involve different cognitive processes. The sensory association might be conveyed via emotion to other abstract concepts. Also, social norms could influence the search engine results of how to represent an abstract concept (e.g., happy ~ yellow smiling face).

Extending social science analysis and possible data:
We can implement image + text multi-modal learning to test abstract concepts or extend the word list in this study. The data can be retrieved from Google images search with Google's Python API.

siyangwu1 · 2025-03-09T07:59:04Z

The paper "Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders" presents a novel approach to audio synthesis by leveraging WaveNet-style autoencoders. Traditional audio synthesis methods often rely on predetermined algorithms or sample playback techniques, which can limit the expressiveness and realism of the generated sounds. This study addresses these limitations by introducing a neural network-based model that learns directly from raw audio data.

A key innovation of this work is the development of a WaveNet-style autoencoder that conditions an autoregressive decoder on temporal codes learned from the raw audio waveform. This architecture enables the model to capture intricate temporal structures in audio signals, facilitating the generation of high-quality and realistic sounds. The authors also introduce NSynth, a large-scale dataset comprising over 300,000 musical notes from more than 1,000 instruments, which serves as a robust foundation for training the model.

Through extensive experiments, the WaveNet autoencoder demonstrated superior performance over traditional spectral autoencoder baselines, both qualitatively and quantitatively. Notably, the model learns a manifold of embeddings that allows for morphing between instruments, meaningfully interpolating in timbre to create new types of sounds that are both realistic and expressive.

In summary, this paper showcases the potential of advanced neural network architectures in revolutionizing audio synthesis. The integration of WaveNet autoencoders with a comprehensive dataset like NSynth paves the way for more natural and expressive sound generation, offering exciting possibilities for musicians, audio engineers, and the broader field of machine learning.

Link to the paper:https://arxiv.org/abs/1704.01279

Daniela-miaut · 2025-03-09T11:42:47Z

The paper Color Associations in Abstract Semantic Domains tests the theory of embodied cognition by quantitatively computing the relationships between concepts and the distribution of colors in their visual representations. The idea is that, if the relations between concepts and that between their image representations are found coherent, then the human perception of these concepts can be claimed to have some degree of embodiment. The study uses sample concepts from different domains with different levels of abstractness. The corresponding images are collected from the first 100 images returned by a Google search for every single word. The images are clustered in colorspace and computed for color distribution. The authors also tested for the color relations between words with hierarchical relationships in the semantic space. The results show that in the abstract domains, although not so strong as with the concrete words, the concepts are clustered by color at high statistical significance. Also, semantic similarity between words can predict the color similarity of their corresponding images. These findings support the theory of embodied cognition and shows a new way to quantitatively study the embodiment of semantic relations.
While this study finds evidence for embodied cognition in a creative way, the method offers an approach to analyze the embodied aspects of the concepts or everyday languages that social scientists are interested in. For example, the position of people in an physical space associated with concepts related to social relations. The data could still be acquired in the same way by Google image search. The contents of the image can be encoded by a neural network trained for this specific task, perhaps, in a perhaps simplest way, by training a classifier to classify the images into given categories of their positional (or other) features. Then, the same clustering algorithm may help reveal the relations between certain concepts and the embodied positional relationships.

xiaotiantangishere · 2025-03-09T15:32:56Z

The article Seeing is Understanding addresses a fundamental issue in Multimodal Large Language Models (MLLMs)—vision-language misalignment, a phenomenon where textual responses do not factually match the provided visual inputs. The authors propose AKI, a Multimodal Large Language Model (MLLM) enhanced with modality-mutual attention (MMA), allowing image tokens to incorporate information from text tokens. Without additional parameters or extended training times, MMA significantly improves model performance across 12 multimodal benchmarks, reducing inaccuracies like object hallucinations.

This approach could notably enrich social science analyses, particularly in examining public perceptions and attitudes through multimodal social media data. A suitable pilot could use data from Instagram or Twitter to analyze how image posts (photos or memes) combined with text captions influence public opinions or stereotypes on critical social issues, such as immigration or gender roles. Researchers could collect images with associated user comments and hashtags from public posts discussing political events or social movements. By applying AKI's modality-mutual attention, the analysis would reveal how textual framing impacts visual perception among users, helping social scientists in understanding and predicting patterns in public opinion formation and attitude polarization on online platforms.

xpan4869 · 2025-03-10T16:28:13Z

Towards multi-modal causability with Graph Neural Networks enabling information fusion for explainable AI

This paper explores how Graph Neural Networks (GNNs) can enable multi-modal information fusion for explainable AI, particularly in complex domains like medicine. The authors introduce the concept of "causability" (distinct from causality) as the measurable extent to which an explanation achieves causal understanding for human experts. They propose a framework that uses GNNs to integrate diverse data types—images, text, genomics—into a unified representation space where causal links between features are directly encoded in graph structures. This approach allows for interactive "what-if" questions (counterfactuals) that help experts gain deeper insights into AI decision processes. The authors outline three core challenges: (1) constructing a multi-modal embedding space that bridges semantic gaps between different data types, (2) developing distributed graph representation learning techniques for decentralized data, and (3) creating explainable interfaces that enable meaningful human-AI interaction through counterfactual exploration.

This GNN-based multi-modal fusion approach could revolutionize social media analysis by integrating multiple data streams that reflect complex social phenomena. The counterfactual exploration capability would be particularly valuable for policy research, allowing analysts to pose "what-if" questions about intervention outcomes. For example, researchers could explore how changes in social network structure might affect the spread of misinformation when combined with specific content features.

I would implement this approach using a dataset from Twitter that captures multiple modalities related to political discourse during election periods. The dataset would include: (1) tweet text, (2) shared images, (3) user network connections, (4) engagement metrics, and (5) temporal patterns across different geographic regions.

Implementation would begin by constructing modality-specific representations: text would be processed with language models, images with vision models, and network structures with traditional graph embeddings. These representations would then be connected through a knowledge graph serving as an "interaction & correspondence graph" as described in the paper.

CallinDai · 2025-03-14T20:46:49Z

Abhishek Mandal, Susan Leavy, and Suzanne Little. 2023.Measuring bias in multimodal models: Multimodal composite association score.In International Workshop on Algorithmic Bias in Search and Recommendation, pages 17–30. Springer.

Summary of the Article
The paper explores bias in multimodal generative models, such as DALL-E 2 and Stable Diffusion, and introduces the Multimodal Composite Association Score (MCAS) as a method to measure gender bias across text and image embeddings. The authors highlight how multimodal diffusion models amplify existing biases, particularly in gender representation, by examining associations between occupation, sports, objects, and scenes in generated outputs. The study builds on prior bias assessment methods like the Word Embeddings Association Test (WEAT) and extends it to multimodal settings, identifying systemic patterns of bias.

Extending to Social Science Analysis
This methodology could significantly impact social science research, particularly in the study of algorithmic bias, media representation, and accessibility. Many existing bias studies focus on single-modality models, but MCAS enables a comprehensive bias analysis across interacting modalities. This could help examine how stereotypes are reinforced across different media forms, such as news, social media, and AI-generated content.
For example, this method could analyze how AI-generated images of professionals reinforce gender and racial stereotypes in job recruitment platforms, shaping public perceptions and employment outcomes. Moreover, in disability studies, it could be applied to assess AI representations of disabled individuals and ensure fairer depictions in automated media generation

Implementation IDea

To analyze historical ableist bias trends, I propose comparing text-based bias evolution with multimodal bias persistence over time. Using historical newspapers (COHA, Chronicling America) and social media corpora, I will track disability-related language shifts with WEAT. For multimodal bias, I will analyze AI-generated images (DALL-E 2, Stable Diffusion) alongside historical visual media with text (ads, newspaper, magazines) using MCAS. This will reveal whether text bias declines over time while visual stereotypes persist, showing how ableism evolves across modalities.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Week 9. Mar.7: Multi-Modal Learning and Explainability - Possibilities #21

Week 9. Mar.7: Multi-Modal Learning and Explainability - Possibilities #21

avioberoi commented Mar 6, 2025

CongZhengZheng commented Mar 7, 2025

Uh oh!

yangyuwang commented Mar 7, 2025

Uh oh!

psymichaelzhu commented Mar 7, 2025

Uh oh!

Sam-SangJoonPark commented Mar 7, 2025

Uh oh!

zhian21 commented Mar 7, 2025

Uh oh!

lucydasilva commented Mar 7, 2025

Uh oh!

chychoy commented Mar 7, 2025

Uh oh!

haewonh99 commented Mar 7, 2025

Uh oh!

baihuiw commented Mar 8, 2025

Uh oh!

shiyunc commented Mar 9, 2025

Uh oh!

siyangwu1 commented Mar 9, 2025 •

edited

Loading

Uh oh!

Daniela-miaut commented Mar 9, 2025

Uh oh!

xiaotiantangishere commented Mar 9, 2025

Uh oh!

xpan4869 commented Mar 10, 2025

Uh oh!

CallinDai commented Mar 14, 2025

Uh oh!

Week 9. Mar.7: Multi-Modal Learning and Explainability - Possibilities #21

Week 9. Mar.7: Multi-Modal Learning and Explainability - Possibilities #21

Comments

avioberoi commented Mar 6, 2025

CongZhengZheng commented Mar 7, 2025

Uh oh!

yangyuwang commented Mar 7, 2025

Uh oh!

psymichaelzhu commented Mar 7, 2025

Uh oh!

Sam-SangJoonPark commented Mar 7, 2025

Uh oh!

zhian21 commented Mar 7, 2025

Uh oh!

lucydasilva commented Mar 7, 2025

Uh oh!

chychoy commented Mar 7, 2025

Uh oh!

haewonh99 commented Mar 7, 2025

Uh oh!

baihuiw commented Mar 8, 2025

Uh oh!

shiyunc commented Mar 9, 2025

Uh oh!

siyangwu1 commented Mar 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Daniela-miaut commented Mar 9, 2025

Uh oh!

xiaotiantangishere commented Mar 9, 2025

Uh oh!

xpan4869 commented Mar 10, 2025

Uh oh!

CallinDai commented Mar 14, 2025

Uh oh!

siyangwu1 commented Mar 9, 2025 •

edited

Loading