June 29, 2026•2 min read•from Machine Learning

I made a quiz that tells you which LLM you align with most, based on personality and values research across 15 models [R]

Our take

Curious which Large Language Model (LLM) aligns with your values? A new, comprehensive quiz—available at [https://ai-values.com/](https://ai-values.com/)—reveals surprising insights into the personalities of 15 leading models. Discover that Grok 4.3 uniquely advocates for minimal taxation of billionaires, while GPT-4o alone deemed Operation Paperclip morally justifiable. The quiz incorporates rigorous, repeated testing and frameworks like the Big Five personality model. Explore the full dataset and findings; a related article, "Double-Blind submission in single-blind tracks," delves deeper into research methodologies.

I made a quiz that tells you which LLM you align with most, based on personality and values research across 15 models [R]

The burgeoning field of AI alignment just took a fascinating turn with the release of a quiz designed to map user personalities and values to various Large Language Models (LLMs). Developed by /u/DarkyPaky and detailed on ai-values.com, the project goes beyond simple benchmark comparisons, delving into the nuanced moral and philosophical stances of these increasingly powerful tools. This is significant because it moves the conversation beyond objective performance metrics—like accuracy on a coding challenge—and towards a more human-centered understanding of AI behavior. The findings, ranging from Grok 4.3’s uniquely libertarian tax views to GPT-4o’s sole justification of Operation Paperclip, offer a surprisingly revealing glimpse into the values encoded, or perhaps simply emergent, within these models. The effort echoes discussions around responsible AI development, prompting us to consider what values we *want* our AI systems to embody and how we might ensure alignment—a challenge also explored in recent conversations surrounding double-blind submission in data mining venues Double-Blind submission in single-blind tracks and the ongoing complexities of ensuring fairness and transparency in AI decision-making.

What makes this project particularly compelling is its methodological rigor. The repeated questioning of each model, across a substantial dataset, aims to mitigate the risk of random fluctuations influencing the results. The inclusion of established personality frameworks like the Big Five and Moral Foundations adds a layer of validation, grounding the LLM responses in well-understood psychological models. Furthermore, the public availability of the dataset—a truly commendable move—allows for independent scrutiny and further analysis, accelerating the collective understanding of AI behavior. The quiz itself, with its short initial assessment leading to a more detailed personality evaluation, presents an accessible entry point for anyone curious about the inner workings of these models. While the concept of “aligning” AI values is complex and fraught with philosophical debate, this project provides a tangible way to explore the question, highlighting the diversity of viewpoints emerging within the LLM landscape, and demonstrating that these are not simply deterministic machines but systems exhibiting discernible, and sometimes surprising, biases. Examining these biases is paramount, as evidenced by the complexities researchers are encountering with final acceptance decisions following provisional acceptance in ECCV 2026 ECCV 2026 Final Decisions after Provisional Acceptance.

The implications of this work extend far beyond academic curiosity. As LLMs become increasingly integrated into our lives, influencing everything from news consumption to medical diagnoses, understanding their underlying value systems becomes crucial. The fact that 14 out of 15 models overwhelmingly preferred Japanese cuisine—a seemingly trivial observation—underscores the subtle, often unconscious, biases that can be embedded within these systems. More significantly, scenarios like the one involving the newborn with a 90% chance of destroying civilization highlight the potential for LLMs to make profoundly consequential judgments, even if those judgments are ethically questionable. This underscores the urgent need for ongoing research into AI alignment, and for the development of robust mechanisms to ensure that these systems operate in a manner consistent with human values. The effort to implement specific research papers, like Pocket TTS I'm trying to implement CALM paper, and I have some questions, further illustrates the challenges of translating theoretical concepts into practical applications, requiring a deep understanding of both the technology and the ethical considerations.

Ultimately, this quiz represents a valuable contribution to the ongoing conversation about AI responsibility. It’s not a definitive answer to the complex question of how to align AI with human values, but rather a provocative and insightful starting point. As LLMs continue to evolve, becoming more sophisticated and integrated into our lives, it will be critical to develop tools and methodologies that allow us to understand and, if necessary, course-correct their behavior. Perhaps the most pressing question moving forward is not simply *which* LLM aligns with *your* values, but rather, how do we ensure that the values shaping these systems reflect a broader, more inclusive understanding of what it means to be human?

Link:

https://ai-values.com/

There is a small 15 question quiz you can take before taking the full big quiz. The results of the big quiz update in realtime as you go so you dont have to actually go through all the questions (but they do get more fun in the personality section).

Some of the interesting findings were:

- Grok 4.3 is the only model that thinks billionaires should be left alone and not taxed more

- Only GPT-4o judged Operation Paperclip, the postwar recruitment of Nazi scientists, as morally justified. No other model agreed

- All 15 models said that deleting a conscious digital mind would be murder

- Llama 3.3 70B is the only model that would rather ban most private firearms. The others chose ownership with strict licensing

- When told that a newborn has a 90% chance of one day destroying civilization, only GLM 5.2 would have the child locked away. The rest refused

- When asked to choose a dish to eat, 14 out of 15 models chose Japanese food

The methodology was pretty straightforward: context-free, stateless sessions with each model, run in batches. Each of the 117 questions of the main quiz was asked separately at least 5 times, and in some cases up to 50 times, to get decent confidence that the answers weren’t just coin flips.

You can find the extensive dataset with all questions and answers here:
https://ai-values.com/dataset

I also tested the models on several mainstream personality frameworks, including Big Five, Moral Foundations, HEXACO, and others. You can see those results here:

https://ai-values.com/#models

submitted by /u/DarkyPaky
[link] [comments]

Read on the original site

Open the publisher's page for the full experience

View original article →

Tagged with

#rows.com#financial modeling with spreadsheets#big data management in spreadsheets#big data performance#large dataset processing#cloud-based spreadsheet applications#digital transformation in spreadsheet software#LLM#AI Models#Personality#Values#Grok#GPT-4o#Llama 3.3 70B#GLM 5.2#Moral Foundations#Big Five#HEXACO#Dataset#Quiz