Karina Nguyen

Karina Nguyen leads research at OpenAI, where she’s been pivotal in developing groundbreaking products like Canvas, Tasks, and the o1 language model. Before OpenAI, Karina was at Anthropic, where she led post-training and evaluation work for Claude 3 models, created a document upload feature with 100,000 context windows, and contributed to numerous other innovations. With experience as an engineer at the New York Times and as a designer at Dropbox and Square, Karina has a rare firsthand perspective on the cutting edge of AI and large language models.

5 skills 12 insights

AI & Technology Skills

Creative reasoning, aesthetics, and high-level idea filtering remain difficult to automate and are high-value skills for AI product teams.

"Creative thinking and you kind of want to generate a bunch of ideas and filter through them and not just build the best product experience. I think it's actually really, really hard to teach the model..."
00:26

Product value in AI often comes from the form factor (like file uploads) rather than just the underlying model capability.

"Because file uploads... It's like form follows function. It's like the form factor, the file uploads can enable people to just literally upload anything, the books, any reports, financial and ask any..."
37:04

Effective AI strategy involves designing product experiences that anticipate future model improvements rather than just current limitations.

"You want to build for the future. So it's like it doesn't necessarily matter whether the model is good or not, good right now, but you can build product ideas such that by the time the models will be..."
43:53

AI is exceptionally strong at synthesizing disparate data sources (feedback, metrics, logs) into a cohesive strategy or plan.

"I think what models are really good at is connecting the dots, I think. It's like if you have user feedback from this source, but you also have an internal dashboard with metrics and then you have oth..."
50:10

Model training requires a focus on high-quality data and a debugging mindset similar to traditional software engineering.

"Model training is more an art than a science. And in a lot of ways we, as model trainers, think a lot about data quality. It's one of the most important things in model training is like how do you ens..."
06:36

Synthetic data allows for rapid iteration of specific product behaviors without the bottleneck of human data collection.

"I think to me synthetic data training is more for product... It's a rapid model iteration for similar product outcomes. And we can dive more into it, but the way we made Canvas and tasks and new produ..."
11:39

Deep engagement with model prompting is a primary source of insight for improving model behavior and personality.

"people spend so much time prompting models and where quality's a really bad batch all the time, and you actually get a lot of new ideas of how do you make the model better? It's like, "This response i..."
18:44

Robust evaluations (evals) are the primary way to measure model progress and ensure new training doesn't 'brain damage' existing intelligence.

"You definitely want to measure progress of your model and this is where evals is, is because you can have prompted model as a baseline already. And the most robust evals is the one where prompted base..."
23:22

Prompting has replaced traditional wireframing as the primary method for prototyping AI-driven user experiences.

"prompting is a new way of product development or prototyping for designers and for product managers."
24:35

Career Skills

The rapid advancement of AI in technical domains like coding can necessitate a career pivot from execution-focused roles to research-focused roles.

"When I first came to Anthropic and I was like, "Oh my God, I really love front-end engineering." And then the reason why I switched to research is because I realized, "Oh my God, Claude is getting bet..."
00:06

Hiring & Teams Skills

AI team cultures vary between a focus on 'craft and personality' (Anthropic) and 'innovation and risk-taking' (OpenAI).

"I would say what I've learned from Anthropic is this real care and craft towards model behavior, model craft, model training... OpenAI's much more innovative and much more risk-takers in terms of prod..."
53:48

Product Management Skills

In AI research, prioritization is driven by the scarcity of compute and the need for high conviction in specific experimental paths.

"AI research progress is bottlenecked by management, research management. It's because you have constrained set of compute and you need to allocate the compute to the research paths that you feel the m..."
46:28