First, repeatedly

Across more than 190 projects in three Kenyon courses — Programming Humanity, AI for Humanity, and the Senior Seminar — one pattern repeats: a new capability ships, and the students Elkins and Jon Chun mentor are auditing it, fine-tuning it, or building with it the same semester. The research community arrives at the same questions one to three years later.

In June 2026, The New Yorker’s “Eight Predictions for the Future of Higher Education” named frontier-lab, project-based research as one of the forms higher education will take by 2035. Elkins and Chun have run that model since 2016, when they founded the human-centered AI curriculum and the AI CoLab at Kenyon College — giving humanities and social-science students the kind of research lab STEM students have always had.

Done at the frontier, as it moved

Selected projects, anchored to what existed — and what didn’t — when students did the work. Every name links to the full paper on Digital Kenyon.

  • 2018
    Neural networks in a humanities classroom — before GPT-2 existedThe first AI for Humanity cohort trained recurrent networks to generate sheet music (Seth Colbert-Pollack), built convolutional style-transfer systems (Miles Shebar), applied deep reinforcement learning to trading (Tucker Bennett et al.), and ran NLP across thirty years of Ohio college newspapers (Shane Canfield) — the year before GPT-2 was announced.
  • 2019
    Fine-tuning GPT-2 within months of its staged releaseOpenAI released the 345M GPT-2 model in May 2019 and withheld the full model until November. That same year, students were already fine-tuning 345M for creative work: Alasia Destine-DeFreece and collaborators asked whether GPT-2 could replace a Sex and the City writers’ room, and Olivia Kane framed “centaur screenwriting.” Others worked with StyleGANs (Sophie Dodd) and LSTM-generated 3D sculpture (Nick Downey).
  • 2020
    The Gwern-cited cohort — and GPT-3 the year of its APIThree student papers on GPT-2 poetry and lyric generation — Jonah Zitelli on James Wright, Kaiya Case on John Donne, and Sophie Barrio on hit song lyrics — were cited and archived by Gwern Branwen, whose site remains a standing reference for early GPT-2 creative experimentation. GPT-3’s API opened in June 2020; by fall, Emmy Roday was studying GPT-3’s reimagining of Ginsberg’s “Howl,” and Rebecca Lawson used stylometry on AI-generated Nancy Drew, anticipating AI-text detection.
  • 2021
    Systematic fine-tuning experimentsRebecca Lawson ran a hyperparameter study — temperature, epochs, corpus size — on GPT-2 screenplay generation, the kind of ablation then rare outside ML labs. Grant Holt asked whether a language model can write history; Teddy Kamin fine-tuned on Mike Royko’s journalism; Sarah Groustra trained GPT-2 on The Rocky Horror Picture Show to ask whether camp is computable.
  • 2022
    Red-teaming ChatGPT within weeks of launchChatGPT launched November 30, 2022. Before the semester ended, Adam Blum had completed “Breaking ChatGPT with Dangerous Questions,” a systematic study of how the model prioritizes safety, context, and obedience — among the earliest jailbreak research anywhere. The same year, Fredrika Pfeiffer compared GPT-2 and GPT-3 on Das Kapital, Simon Hua treated prompt engineering as a research object before it had a literature, and Raya Kenney audited bias in image generators.
  • 2023
    Auditing GPT-4 the spring it shippedGPT-4 arrived in March 2023; students audited it immediately. Abigail Foster tested whether GPT-4 could fool TurnItIn’s detector. Annalia Fiore evaluated GPT-4’s consistency in assessing — and self-assessing — literary texts, anticipating “LLM-as-judge.” Rishil Kondapaneni benchmarked ChatGPT against Google Bard; Dillon Cleary asked whether an LLM could stand in for an executive on an earnings call.
  • 2024
    Studying real human-AI interaction at scaleHannah Sussman topic-modeled 3,275 real ChatGPT conversations as the first large conversation datasets became available. Richard Anthony Álvarez built a retrieval-augmented film recommendation system; Nava Bahrampour paired sentiment analysis of autism-diagnosis narratives with an audit of AI attitudes toward disability language.
  • 2025
    Multi-agent systems, benchmarks, and domain fine-tunesGodwin Idowu tracked emotion, persuasion, and deception across multi-agent negotiations with valence-arousal-dominance modeling, and built a contamination-resistant math benchmark. Juliette Lowe fine-tuned BioLinkBERT for pediatric rheumatology. Marisol Hernandez Brito turned the program’s “shapes of stories” methods on AI’s own outputs; Parker Gibbons used multi-agent debate to automate MLB front-office decisions.
  • 2026
    AI agent networksMuhammad Ibraheem Nadeem applied network analysis to Moltbook, an AI agent social network, identifying missing memory as a structural problem. Ayesha Aslam combined aspect-based sentiment analysis and behavioral analytics to study how AI influencers capture attention.

More than 190 projects, 2018–2026

Across three courses. Every title links to the full paper on Digital Kenyon; the selection below is representative. Projects flagged first were conducted in the same year the underlying capability shipped — or before the research area had a name.

Generative AI & Creative Writing

Fine-tuning and probing language models for poetry, screenwriting, lyrics, comedy, and journalism — beginning the year GPT-2 shipped.

Sentiment Analysis & the Shapes of Stories

The program's signature method: computational analysis of narrative and emotional arc, applied to literature, translation, memoir, and film.

Social Media & Political Discourse

Computational social science on live political events — elections, protests, and public opinion as they unfolded.

AI Art, Music & Multimodal Creativity

GANs, style transfer, generative music, and computer vision for visual culture — beginning in 2018.

Human-AI Interaction & AI for Learning

How people actually use AI — studied empirically — and AI designed around human needs.

Agentic & Multi-Agent AI

Emotion, deception, debate, and network structure in systems of interacting AI agents.

Applied AI: Finance, Business & Prediction

Machine learning and LLMs for markets, forecasting, and organizational decision-making.

Capstone Builds: Products & Platforms

Senior Seminar projects that shipped: working platforms, tools, and systems.

Read, cited, and built on

107,766
downloads of student research
198
countries
4,700+
institutions, from Stanford to Microsoft

Gwern Branwen cited three of these papers on his GPT-2 poetry page and preserved them in his archive. Meta AI researchers cited Adam Blum’s ChatGPT red-teaming study — a class paper written within weeks of launch. Forbes featured Parker Gibbons’ AI scouting project, and profiled the program in “Where AI Meets the Humanities.” Nine of these projects appear in the works cited of “Beyond Plot”, and The Shapes of Stories (Cambridge UP, 2022) credits Erin Shaheen with proposing one of its central inquiries.

For the keynotes and forums where this decade of work has been presented, see Speaking; for grants, awards, and the scholarly reception of the faculty research it grew from, see Recognition and Scholarly Reception. The projects above draw on the open-access Human-Centered AI collections at Digital Kenyon.