Week 4: Jan. 31: Text Learning, Transformers, and Interpretability - Orienting #10

ShiyangLai · 2025-01-28T20:52:46Z

Post your questions here about: “Language Learning with Large Language Models”, chapter 11 in Thinking with Deep Learning.

chychoy · 2025-01-30T21:14:40Z

According to Chapter 11, a possible use for LLM is to simulate political actors and other social situations in social science research. This will be built of "properly [conditioned] language models to simulate a particular demographical group." This is an interesting use case as in the real world, there are often restraints and constrictions either on the budgetary or ethical levels that prevent us from being able to run certain experiments. Will LLM-based experiments expand the horizon for what could (or could not be done)? How do we interpret or reproduce results that we learn from an LLM-based experiments?

yangyuwang · 2025-01-31T00:55:01Z

In this chapter, we discussed lots of the application of how LLMs would be utilized for social science research. I am quite curious of how LLMs would backward influence the "social science world". For example, as LLMs are increasingly used to simulate human agents and interactions, would it decrease the number of surveys and experiments? Or in other words, would it be possible to change or add something to the paradigm of social sciences research?

Moreover, as the LLMs largely influenced human life, could they still be powerful to simulate human after the huge amount of use on LLMs? For an extreme instance, would it be biased by lots of texts that are generated by themselves in social medias?

christy133 · 2025-01-31T04:13:49Z

The book chapter talks about how LLMs can exhibit "uniformity biases" where they systematically underrepresent variance in responses compared to human populations. To address this, the authors suggest modeling agents as "quantum entanglements" of multiple vectors to better represent unique combinations of experiences (p41). But how could we redesign LLMs to better simulate the kind of cognitive diversity that drives innovations and even scientific breakthroughs? If unusual life histories and cross-disciplinary experiences are key to innovation, what does it mean in terms of rethinking the architecture of LLMs beyond just training on more diverse data?

lucydasilva · 2025-01-31T04:43:55Z

I was struck by the fact that semantic meaning can be mathematically represented as relations/proximity in vector space, rather than as a dictionary style term-definition -- and then can be reproduced according to contextual specificities through self-attention (i.e., going from a vector to a matrix).
I'm wondering if a similar move (from vector to matrix/from 2d to 3d) can help deepen an understanding of CoT -- which can be understood as a relationship between two points (from 2+2 to 2+2 = 4). Is part of the reason that CoT seems to generate intelligence without rationale because it is stuck in dual relationality, which is (as we know) certainly not the form of reasonable thought?

Sam-SangJoonPark · 2025-01-31T04:49:45Z

LLm, powered by cutting edge technology like self-attention mechanisms and Transformer architecture, deeply understand context and have established themselves as innovative tools across various research fields. Especially, with the expansion into multimodal models, they are unlocking new possibilities for exploring complex social and cultural analysis.

These advancements have undeniably opened new pathways that were previously inaccessible to humans, however, caution is needed when accepting the results generated by LLMs.

We must carefully consider what specific factors to be mindful of and how much we can trust these analyses. My question is, is there a clear standard or framework for accepting LLM-based findings? like we have F-1 score or accuracy, precision for ML interpreting and accepting Machine Learning analysis results.
These questions remain crucial for ensuring responsible and effective use of LLMs in research.

DotIN13 · 2025-01-31T05:20:09Z

How do we formalize the uniformity bias in LLMs, where responses implicitly reflect a "crowd of language speakers" and tend to veer toward the centroid of the system? Does this imply that token representations in a given context converge toward a single point in the latent space, leading to low variance? If so, how does this phenomenon interact with the structure of the model's embedding space, and what role does the attention mechanism play in preserving or mitigating such convergence?

psymichaelzhu · 2025-01-31T05:23:11Z

How to build efficient LLMs on low-resource languages (such as some African languages)? What strategies (perhaps transfer learning) can be used to alleviate the problem of data scarcity?

zhian21 · 2025-01-31T05:26:13Z

Large Language Models (LLMs) can simulate complex social behaviors, but evaluating their validity is challenging when real-world data is limited or nonexistent. Existing methods rely on dataset comparisons and task vector analysis, yet these approaches fail in cases of emergent phenomena, counterfactual simulations, and underrepresented populations. Standard statistical metrics also overlook nuances in social dynamics, requiring alternative measures of plausibility and coherence. Given these limitations, how can we assess the validity of LLM-generated social simulations without real-world comparative data, particularly for emergent or counterfactual behaviors?

JairusJia · 2025-01-31T05:30:24Z

The article discusses the application of large language models (LLMs) in social simulation and policy analysis, but LLMs are mainly based on existing text data for reasoning, while social science research often focuses on dynamically changing social structures and individual decision-making processes. In this case, can LLMs effectively simulate social institutional changes, group behavior evolution, or the long-term impact of policy interventions?

Daniela-miaut · 2025-01-31T05:34:43Z

I am wondering what is the criteria to choose which model to use in agent-based modeling with LLM, and how are researchers expected to justify their choice of models.

xpan4869 · 2025-01-31T05:42:32Z

The chapter demonstrates several successful applications of LLMs in social science research, particularly in modeling aggregate behaviors such as political voting patterns and group interactions. While these applications show impressive accuracy in predicting collective outcomes, how can social scientists effectively combine LLM-based analysis with traditional research methods to develop more comprehensive understandings of social phenomena? What are the complementary strengths of each approach?

haewonh99 · 2025-01-31T06:01:48Z

When the new word is predicted based on prior/later words, what happens when two words have the same probability of appearance? Is there a safety net that kicks in when this happens? Or is this case not our concern due to the low probability of this happening due to the huge number of dimensions?

baihuiw · 2025-01-31T06:09:08Z

How do large language models handle multilingual inputs, and what are the challenges associated with training and optimizing them for diverse linguistic structures?

tonyl-code · 2025-01-31T09:24:22Z

How can we begin to evaluate the social simulcra? It reminds of the life2vec paper from week 1 - perhaps we could use an actual sequence of life events for the agents and compare some outcome?

kiddosso · 2025-01-31T10:20:37Z

How would you describe the family of transformers? I know more or less there is a whole family of transformers now. What are their relationships? What are the innovations between each of them? And what commenalities they share except having multiple transformers?

CallinDai · 2025-01-31T12:04:12Z

LLMs exhibit uniformity bias, If LLMs implicitly reflect the "majority" of language speakers, do they inherently resist cultural and linguistic innovation? Specifically, how might it affect the long-term dynamics of linguistic change? If a society is largely having long-term interaction with this kind of model, how would it shift people's ideology? Would people be more uniformed in conceptual spaces?

ulisolovieva · 2025-01-31T15:59:30Z

How do we account for the positivity bias when using LLMs like digital doubles in experiments? 2) Chronological model training sounds particularly interesting - it could be used like a natural experiment before/after historical events - given outlined limitations, I wonder if changes in cultural trends can still be captured in today's models (e.g., word embedding similarity of xyz over time)?

tyeddie · 2025-01-31T16:34:30Z

How does self-attention in LLM models works in “sharing information from their local context?” And How it relates to deep learning architectures that excels at working with sequential data like RNN?

youjiazhou · 2025-01-31T17:03:06Z

The application of LLM seems to rely heavily on researchers’ existing knowledge, context, assumptions, and training data from mainstream society/culture. So how can it be used to discover novel variables or social changes?

yilmazcemal · 2025-01-31T17:41:36Z

LLMs are very promising for research. They can dive deep to the data we have, which may be impenetrable for a human researcher, and give insights that we can interpret like for example through topic modeling. But, how do we make sure the results we get are reliable and replicable? What kind of measures of uncertainty there is that we can use to compare "good" insights from "bad" ones?

shiyunc · 2025-01-31T18:27:29Z

Powerful LLMs like chatgpt was once criticized for bad performance in simple maths (such as word-counting a paragraph, or calculating how many "r"s are there in the word "strawberry"). Is this relevant to the features of the tranformer architecture? GPT4.0 is better than GPT3.5 on doing simple math. How is that improvement realized if the problem is with the architecture?

xiaotiantangishere · 2025-01-31T23:22:09Z

The article mentions that cognitive and social simulations can help explore social phenomena and produce the 'wisdom of crowds.' I was wondering whether the idea generated by LLMs primarily caters to mainstream ideas, potentially 'crowding out' alternative perspectives and reinforcing an Echo Chamber effect?

siyangwu1 · 2025-02-06T19:57:52Z

How can social scientists effectively combine LLM-based analysis with traditional research methods to develop more comprehensive understandings of social phenomena? What are the complementary strengths of each approach?

CongZhengZheng · 2025-02-06T21:39:22Z

I am a bit confused with the different methods for chain-of-thoughts. How do the multiple CoTs evaluate the score of outputs? Is it based on user’s feedback or other metrics? Do Tree of Thoughts and Graph of Thoughts come with the sacrifice of longer computational time and resources? What is the point of aggregating chains instead of selecting the one with best score?

Week 4: Jan. 31: Text Learning, Transformers, and Interpretability - Orienting #10

Week 4: Jan. 31: Text Learning, Transformers, and Interpretability - Orienting #10

Comments

ShiyangLai commented Jan 28, 2025

chychoy commented Jan 30, 2025

Uh oh!

yangyuwang commented Jan 31, 2025

Uh oh!

christy133 commented Jan 31, 2025

Uh oh!

lucydasilva commented Jan 31, 2025

Uh oh!

Sam-SangJoonPark commented Jan 31, 2025

Uh oh!

DotIN13 commented Jan 31, 2025

Uh oh!

psymichaelzhu commented Jan 31, 2025

Uh oh!

zhian21 commented Jan 31, 2025

Uh oh!

JairusJia commented Jan 31, 2025

Uh oh!

Daniela-miaut commented Jan 31, 2025

Uh oh!

xpan4869 commented Jan 31, 2025

Uh oh!

haewonh99 commented Jan 31, 2025

Uh oh!

baihuiw commented Jan 31, 2025

Uh oh!

tonyl-code commented Jan 31, 2025

Uh oh!

kiddosso commented Jan 31, 2025

Uh oh!

CallinDai commented Jan 31, 2025

Uh oh!

ulisolovieva commented Jan 31, 2025

Uh oh!

tyeddie commented Jan 31, 2025

Uh oh!

youjiazhou commented Jan 31, 2025

Uh oh!

yilmazcemal commented Jan 31, 2025

Uh oh!

shiyunc commented Jan 31, 2025

Uh oh!

xiaotiantangishere commented Jan 31, 2025

Uh oh!

siyangwu1 commented Feb 6, 2025

Uh oh!

CongZhengZheng commented Feb 6, 2025

Uh oh!