Nothing Breaks Like A.I. Heart

An essay about artificial intelligence, emotional intelligence, and finding an ending

Story

Pamela Mishkin

with help from GPT-3

Design + Code

Russell Goldenberg

Jan Diehm

Illustrations

Maria Scherlies

Motion

Omar* and I matched 13 days before I left England. From that point on, we spent maybe 12 hours apart. The rest was getting lost in the library stacks, walking hand in hand across the Bridge of Sighs.sitting in the grass on the lawn of King’s College, laughing our arses off of the Footlights Pantomime.fumbling our way through the college bar crawl, cycling to Grantchester for tea and scones. We were filling in the outline of a love story I’d written in my head reading novels and Instagram posts by American women studying abroad. It was the perfect beginning but it happened at the end of the year.

I needed to leave for a new job in San Francisco. He needed to stay to finish his PhD. I begged the airline to let me change my flight. He sprinted to the train station to say goodbye.

Text that looks like this has been generated and text with a switchtext with a toggletext with this symbol can be swapped out by clicking.

I always assumed you don’t realize when you’re living in a cliche. But, in actuality, when you’re on one end of the romantic comedy trope, waiting at the station for a person who looks like they could have appeared in an Ezra Pound poem,could have appeared in a book about Hemingway’s racing fever,could have been at the Chelsea Hotel when Sylvia Plath lived there, you recognize and fear the inevitable final scene. Nothing can actually stay trite forever. In real life the ending is never so neat.

To navigate this story, think of each circle as a chance to choose your own adventure. Pick an option below to change the narrative path.

He said he’d move to be with me.

I canceled my flight.

I got on the plane.

He moved with me.

I never should’ve fallen for a broke grad student. But I did. And I couldn’t have imagined the nightmare of moving to a new country for him and then getting dumped. A year and a half later, I’m starting to feel like myself again.

You’ve chosen a path where the story ends. Select another path to continue.

He said he’d move to be with me. I got on the plane and ended things. And then, because I am a millennial or because I am me, I tried to change the ending. He wouldn’t take me back, but we agreed to try being friends.

I moved in. I started my new job. We got engaged. We sold our flat in London. He moved to California. I moved in. We got married. And now, we’re writing a book.

You’ve chosen a path where the story ends. Select another path to continue.

I am not a writer; I am a machine learning engineer. In my work, the question of whether something has ended or not is steeped in data and trendlines: Is the time series stationary? Is the shock transitory or permanent? Answering these questions requires poring over every piece of a dataset, extracting features that might mean something, identifying each possible inflection point. When I try to apply this energy to my love life (or lack thereof), friends tell me I’m “dwelling too much on the past” or “living in a fantasy world” or “not really their friend.”

So, when two years had passed and my feelings for Omar still weren’t resolved, I didn’t tell my friends. I told a supercomputer.

GPT-3 — the third “generative pre-trained transformer” released by the start-up (and my employer) OpenAI — is an example of a language model, or a tool that predicts what sequence of words should follow a user-provided prompt. Given a prompt like, “Hello my name” the model will, more often than not, suggest that the next word is is.

It happens to be the biggest publicly released model of its kind: 175 billion parameters (although access is gated through an API with strict use case guidelines). You can think of a parameter roughly as a synapse; the human brain has around 100 trillion of those, but it has to focus on lots of things besides language, like swiping on dating apps and moving to San Francisco. To learn all of those parameters, GPT-3 is trained on hundreds of billions of sentences and stories from the Internet and books. Written something on Reddit? There’s a good chance GPT-3 has read it.

The model learns the toxicity of these forums. The generations shown in this essay are cherry-picked to demonstrate particular aspects of the model and what it taught me about my relationship and GPT-3. They're also chosen to minimize the reader's exposure to harmful and toxic content. Encoded biases in language models perpetuate real harm when language models are released in the real world. To learn more about these and other dangers of big language models, I recommend Bender, Gebru et. al. and the original GPT-3 paper. Access to the model is governed by strict usage guidelines and a content filter is available, but there is significant work to do.

Because it has read from so many different sources with so many different authors, and also because it is a computer, the model lacks self-awareness. So, given a prompt like, “Hi, I’m Pamela and Omar doesn’t love me.” GPT-3 will respond as me, Pamela, and write the story of a relationship with all the poeticism and pathos and, yes, melodrama, that any young woman who has been dumped could ever want.

And, that’s just what I asked for.

Write

answers to these questions.

this story.

songs about this romance.

What if his bike hadn’t been stolen after our second date?

If Omar’s bike hadn’t been stolen, it would have ended a lot sooner. He wouldn’t have needed to run to the train station.

What if he hadn’t just gotten out of a seven year relationship?

I might have fallen in love for good.

What if I’d flown back to England on a unicorn made of marshmallows?

I would have had some really good Instagram content.

You’ve chosen a path where the story ends. Select another path to continue.

By the time I got access to the model, it was late July, 2020. In the fifth month of quarantine, having recently moved home to face my teenage journals, I wasn’t sure if I missed talking to strangers or to Omar. But I wanted to know if, with enough prodding, I could turn GPT-3 into either, or at least convince myself that I had.

I put my Notes app drafts into the model hoping to fill in my trailing thoughts. I wrote different permutations thinking I could identify what went wrong. I asked for custom pop songs and sea shanties in the style of Tosca and Taylor Swift about my specific woe. It was as much escapism as it was reining control over a model that can seem incomprehensible and a story that I no longer had control over.

I heard you were a computer scientist and so I said to myself I’d better get to know her (get to know her) I’d better get to know her (get to know her)

You’ve chosen a path where the story ends. Select another path to continue.

Others test using AI to answer philosophical questions or probe the psyche. They say this reveals something about how humans learn and the nature of intelligence.

One argument against the model’s intelligence is that it has a tendency to “overfit,” or get lost in endless loops of repetitive language. This happens when the model essentially memorizes its training data and assumes that everything else should look like it.

You meet someone, you fall in love, you break up, and, heartbroken, you assume you’ll never find anyone who compares. You look for a perfect substitute.

While models have gotten better at avoiding these traps as they’ve gotten larger, there are still sequences of words and language that trigger this spiraling behavior.

I would text.

He would text.

He wouldn’t respond.

He would ignore.

I would text. He would text. I would call. He would call. I would cry. He would cry. I would say I was sorry. He would say he was sorry.

Omar and I were stuck in a loop. I would text, he wouldn’t respond for a few weeks, and then would call one day to check in.

I would text again. He would ignore again. I would text again. He would ignore again. I would text again. He would ignore again.I would text

I tell GPT-3 this and get back, He says he’s sorry. He says he’s not sorry. He says he’s sorry. He says he’s not sorry. He says he’s…He called. I didn’t answer. I called. He didn’t answer. I texted. He didn’t respond. I called. He didn’t answer. I texted. He didn’t respond. I called. He didn’t answer.I cry when I think about how much I wanted to be with him. I was so lonely. I was so sad. I was so desperate. I was so stupid. I was so stupid. I was so stupid. I was so stupid. GPT-3’s loops feel reminiscent of picking petals on the schoolyard, "He loves me. He loves me not. He loves me. He loves me not." Loops can be exhausting.

Occasionally one of us would say something – “you look nice,” “he doesn’t sound like your type,” “her?” – that crossed whatever friendship boundary lay between us, a shift in tone somehow more relevant than the 5,000 miles. I would text, he wouldn’t respond for a few weeks, and then would call one day to tell me about someone new he’d met.

I learn that in the original French version of the schoolyard petal rhyme you don’t just loop through "He loves me. He loves me not." but question how far the other person’s affection goes: un peu, beaucoup, passionnément, à la folie, pas du tout.

He loves me a little. He loves me a lot. He loves me passionately. He loves me to madness. He loves me not at all.

I would text, he wouldn’t respond for a few weeks, and then would call one day to tell me about his recent break-up.

When you prompt GPT-3, you request a certain number of words back. A language model is only able to handle a limited number of words in prompt plus generation. This means that it may end your story mid-sentence or thought. Or, it may reach an ending and then keep generating because it’s been instructed to talk for a certain amount of time about the characters with which it’s been prompted. If there are two characters in its universe you are forced to engage with them, even after they and you should have moved on.

Despite our awkwardness, it was casual enough that I was not surprised when, a year after I’d left England, he asked if I needed a place to stay on a layover in London. There was anticipation but not expectation when we discussed how his roommates would feel about someone sleeping on the couch.

Around a week before I was due to arrive he...

suggested we get back together.

stopped replying to me.

told me he was seeing someone new.

I never thought I’d say this but I was pretty sure I was not ready for that. I said I’d think about it. And then, a week later, I was in the underground, staring at his name on my phone, telling him I’d like to meet for a drink.

I spent the day in London in a state of disbelief. We both knew there was a line neither of us were going to cross but the possibility of a friendship was real.

Around a week before I was due to arrive he stopped replying to me. I stared at my phone over four days in Poland. Paid for premium data in Lithuania just in case the replies weren’t coming through. Arrived in London at 5am and schlepped my bag to Cambridge, then back to London and through two museums before my sister told me she’d booked a hotel. I was angry but mostly just embarrassed.

He texted back, “Sup?”

It was a relief to know a breakup was in the cards. There was no need to write a speech or hide my feelings. It was clear he didn’t want to be with me and I didn’t want to be with him. It was also clear that he still wanted to be my friend.

I wore my casual cool girl outfit and carried my favorite dress in case we changed plans. We walked and talked for eight hours, past the Bank of England, past Combe’s Music Hall, past all the Important Historical Places, through and on to a row of houses across from Westminster Bridgethrough the square where the weirdos live and a coffee shop that is starting to feel a lot like a Starbucks, past the door that was featured on a scene in Pride and Prejudice, then down a street where Mary Shelley wrote Frankensteinthrough the City of London and all those tall buildings with the funny names: The Gherkin, The Walkie-Talkie, The Cheesegrater, The Shard, The Spire, The Heron Tower, The Monument, The Axe, The Lloyd's Building, The Leadenhall Building and then we made out and then he said, “I finally feel comfortable being your friend.”

My friends say to Venmo-request him for the price of the hotel room and the dress that blew away. My friends say to delete his number and texts and Facebook messages and not to forget to disconnect on LinkedIn lest I "accidentally" congratulate him on a promotion years down the line. My friends say there’s no intelligence behind my desire to stay friends with him.

GPT-3 doesn’t care about my friends. It doesn’t care that I work at a start-up, live in a city, that I am quarantined in a house with two other people. It doesn’t care that Omar and I didn’t have the language to say what we wanted from each other, that we fought about his insecurity and my loneliness, that I felt like I was losing myself. It doesn’t care which of my sentences are tired or stale or cliche.

After all, in a sense everything GPT-3 generates is cliche because it’s all rooted in everything that has been written before. In my experience, the model can struggle to land at something that feels like the truth when a story is more nuanced than boy meets girl, boy and girl break-up, girl is devastated.

As I write, I notice peculiarities in my sentences: they are circular, they get distracted, they repeat similar phrases. I’m unsure if GPT-3 is picking up on my style or if my writing is adopting the quirks of the model. I'm unsure if GPT-3 is really leading me down its tunneled tropes or if I've also been trained on the same inputs. I’m unsure if GPT-3 is right when it says it’s ended. After all, it’s not really up to me, or the model.

He called one day in the week preceding quarantine and I joked we should meet in New York. He smiled and said, “Actually, that would be pretty fun, I’ll look at flights, let’s talk soon.”

The next day New York shut down.

We chatted, about the weather and the government and the wealth gap.YouTube stars and the transgression of politics and the on-going demise of the human mind.Corbyn, and phobias, and love and hate.

A month passed. We laughed about the Club Penguin date I’d gone on and his debate about an ex who’d returned to London.

More months passed. He called.

I answer the phone as

my friends.

myself.

my teenage self.

Omar: “It’s funny, she’s in San Francisco and basically has your dream job.”

Me, as my friends: Never contact me again.

You’ve chosen a path where the story ends. Select another path to continue.

Omar: “It’s funny, she’s in San Francisco and basically has your dream job.”

Me: “Niiiice!”

Omar: “She visited last week and she’s moving to London in January.”

Me: “So fun!”

Omar: “Happy to stay friends but know it’s probably weird for you.”

Me: “Not at all."

Omar: “It’s funny, she’s in San Francisco and basically has your dream job.”

Me, as my teenage self: People who have dreams usually care about stuff that actually happens in the real world.

Omar: “She visited last week and she’s moving to London in January.”

Me, as my teenage self: That’s a long way to go just to avoid seeing someone you hate.

You’ve chosen a path where the story ends. Select another path to continue.

GPT-3’s outputs say I’m upset that he thought my dream job was to manage make-up artists. That I’m upset that he’d replaced me with someone who lived a few blocks away. That I’m upset that he probably told her he loved her at the train station.

I was mostly upset that I didn’t know if that was his ending it once and for all or if it had really ended months or years prior or if he was going to call me again two days or six months down the line just to check-in.

Contrary to what the model says, I wasn’t devastated that the fairytale didn’t come true.

The reality is more complicated: we were never really dating so we never really broke up, but we were never really friends so I didn't get to be mad when he ghosted. When a relationship falls apart slowly, it’s hard to keep track of all the variables — the unfinished pack of chocolate stars, the awkward kiss that makes me cringe to this day, the train we missed by a few minutes — any of them might have been the end.

The reality is less romantic: a man tells a woman he doesn’t want to be with her after spending years looking for his “soul mate,” and then meets someone soon afterwards. She doesn’t tell her friends for a few months, gets her heart broken quietly into pieces while believing she is slowly piecing it back together, then tells everyone at once that it doesn’t matter anymore.

On occasion the model’s predicted outcomes veered close enough to my truth that there was solace in reading them from a distance. But, ultimately, they still weren’t true.

I decided if Omar refused to write an ending and I didn’t agree with how GPT-3 filled it in, then I would write my own. Even if I needed some help along the way.

Methodology

We used either the base 175B parameter model or a version fine-tuned on “instruction following” for all GPT-3 generations.

Prompts took two main forms. In some, we did an open-ended generation where we’d prompt the model with a few paragraphs of text and let it do the rest. This is how the “wheel generations” work. In many cases this was with the “story text” that appears above that particular wheel as you read it, however we have made some line-edits since.

In other cases, mostly where you see rotating "mad-libs" options of text, we would prompt the model with a “headline” like "Lists of classic Cambridge activities," followed by samples that fit the flow of text and were examples of what we’d want. This is called few-shot learning, we give the model a few samples of what we’re looking for and let GPT-3 do the rest.

We tried our best not to shoehorn responses. For example, I wanted to place a reference to Estelle’s “American Boy” in one spot but couldn’t get the model to cooperate. Other times cooperation felt like cheating, e.g. putting You’ve Got Mail in the prompt gets Sleepless in Seattle in return.

For the overfitting section we deliberately lowered the temperature to induce looping behavior from the model. Other generations were done with a temperature between 0.7 and 0.9.

*We chose the name Omar after some research to establish that it was similar enough to the subject’s real name to make the model outputs reasonable substitutes, and to establish that the details provided about the subject would not be de-anonymizable.

Author’s Note

It’s easy to say I should cite everywhere the model is used but doing so doesn’t fully cover the ways the model influenced the shape and sound of how I wrote this piece.

Whether or not you are copying precise outputs, it’s easy for a story to take on the shape of the model. When generations don’t look how I’d expect I start to edit out details and nuance in the prompt until the model realizes, “Oh, he had to run to the train station because his bike was stolen.” My writing starts to sound like GPT-3: I repeat similar phrases, lose focus in the middle of sentences, and fall back on cliched tropes that feel close enough to the truth. Over time and drafts, the inputs and outputs converge.

I say this precisely to get off the hook for what is written here. The process is fun ("Rewrite this story as Tosca and Taylor Swift and Tolstoy"), until it’s time to hit publish and Omar tells me he can’t wait to learn the truth from what I write and GPT-3 tells me my writing sounds like a young woman or a teen magazine (which are labels I should not scoff at, but do, but hopefully only because of what I know of the model’s internalized misogyny).

Writing with GPT-3, it is hard not to drown in the power of suggestion, “Oh that is a really good framing,” and lose sight of the truth, “No, I really wanted to just be friends!” Even if you feel that truth deeply. It’s similar to writing with an editor, albeit one far less compassionate and patient than Jan is. It’s also similar to putting anything up on a tech platform, and letting that platform refract how you see yourself or your content.

Of course that's also part of being in any friendship or relationship. What it is and what the other person thinks it is and what the other person's friends think you think it is, they're all, at best, co-determined. Somewhere below that is being stranded in London.

And so, I write an Author’s Note, all by myself, just to clarify.