gpTea

2024

Course: 4.044 Interaction Intelligence

Instructors(s): Marcelo Coehlo, JB LaBrune

Collaborator(s): Kevin Tang

Introduction

“gpTea” is a tea set that preserves and encourages the sharing of untold stories between separated friends and loved ones through generative AI. It enhances the ritual of tea drinking by turning each cup into a shared, interactive storytelling experience, encouraging a slower, more reflective form of communication.

It was created for Interaction Intelligence, a design studio taught by Marcelo Coehlo, as an exploration of Large Language Objects (LLOs).

Motivation

We wanted to explore asynchronous and "slow" forms of communication as a mediation for the various degrees of "separation" which make communication and connection difficult between loved ones. The use case which we initially focused on was intergenerational story-telling between grandchildren and grandparents which we felt embodied the most degrees of separation (distance, time, age, culture, language).

Another idea we wanted to explore was the object's potential for storing the collective memories of multiple users and using them to facilitate intimacy among them.

Personally, my experience as a member of the Asian America diaspora heavily motivated the concept for this project. Another student from our class remarked that it would be useful for communicating with his grandfather in China who has no pictures from his childhood to show as gpTea could potentially generate a photorealistic image in this case. We also realized that chatGpt, the LLM which gpTea interfaces, could automatically communicate with the user based on the language which they begin conversing with.

Image 1

Inspiration

We were heavily inspired by previous works in the class such as Kai Zhang's AIncense which blended technology with ritualism as well as Yubo Zhao and Xiying Bao's Narratron which harnessed the generative capabilities of LLMs for story-telling.

Aesthetically, we drew inspiration from the functional shape and coarse texture of traditional earthenware. We opted not to sand the object all the way to emulate the texture of stone.

Image 1
Image 2

We were also drawn to space objects such as the Monolith from 2001 - A Space Odyssey as well as the pod from Arrival which featured sleek, black objects with simple but imposing and somewhat foreboding silhouettes. Overall, we composed the technological and traditional design elements with hopes of forming a "futuristic artifact."

Image 1
Image 2

Finally, we were also drawn to the aesthetics of light projected onto fluids, an experience which resonates with many. Initially, we did not know why this was so captivating but upon further reflection, were able to understand why (which I detail in Design Reflections).

Implementation

Hardware

gpTea’s hardware consists of three parts, the cup, the tray, and the teapot. The cup features a 1.28 inch round display, ESP32 microcontroller, microphone, and 1 liPo battery. The body of the cup has a magnifying lens built into its geometry to blend the display into the liquid content once the cup is filled. The tray houses a speaker, a stepper motor that manipulates the teapot’s rotation via a timing belt, and a rotary encoder equipped with 2 haptic motors for user feedback and one hall effect sensor for detecting the presence of the cup. All the electronics housed in the tray are connected to an arduino nano microcontroller.

Feel free to reach out to my project partner, Kevin Tang, who executed all the hardware + fabrication!

Software

gpTea interfaces with a full-stack application consisting of a React frontend and Express server through both wireless and serial port connection. The application is connected to a MongoDB database to store and retrieve stories. Additionally, the application is configured with API endpoints which the teacup continuously and wirelessly fetches images and effects from.

Prompt Engineering

Image 1
Image 2

Design Choices & Reflections

Adopting the teapot's original design and interactions

We chose to adopt the original aesthetic form and interactions of the teapot for several reasons. Firstly, to leverage an existing, age-old ritual that transcends culture, age, and language. Secondly, to borrow the slow and reflective "time-qualities" of tea-drinking.

We mapped interactions between the user and AI through the original affordances of the teapot – just as the teapot pours tea into a teacup, gpTea augments that familiar and beloved moment by telling, or “pouring,” a story which comes to life at the bottom of the cup. With the cup of tea in hand, the user then drinks the tea just as the told story becomes a part of them. Pouring and drinking become metaphorically linked with listening and absorbing/consuming through the mapping of these interactions.

Designing for poetry over productivity

Though gpTea’s ability to speak and pour tea on its own make it feel “intelligent”, it is the interplay of tea and light which imbues it with poetic qualities. We could have easily displayed the generated the image to the user on an LED screen, but with the generated image appearing at the bottom of the teacup, the tea becomes a blurred boundary between the fictitious, historical, surreal and real. The tea functions then as the “transitional space” between internal fantasy and external reality, an area of psychological experience that engages the imagination.

The visual distortions and discoloring from the movement and hue of the tea create a “digital sfumato” that paints the imagined memory with uncertainty. Coincidentally, the image generated by DALL-E is also nebulous, abstract, and sometimes even eerie. These ambiguities result in an ambience. The imprecision of the display encourages users to fill in gaps in information and to ultimately think more deeply about the stories being told and about the people to whom the stories belong. The projection of light onto tea offers several effects: the rhythmic rippling and swirling motion of the tea creates an entrancing effect; the soft glow of light permeating through the murky tea is ethereal; the fleeting image provides an air of transience.

Image 1
Image 2
Image 3

As the teapot pours autonomously, prompted by the user’s action to activate storytelling mode, the affordance of flowing tea, the rippling and reflective surface of the liquid, and the natural color of the tea saturates the displayed image, creating a natural patina.

Limiting touch points

Users interact primarily through the tea cup as the speaker, microphone, dial and display.

Like tea drinking, communicating through gpTea becomes dialogic -- one step must be executed before proceeding to the next, a continuous exchange occurs between user and object. Limiting touchpoints paces the user’s interaction with the object, creating a “slow” quality that invites reflection. We designed gpTea to be “slow” (as opposed to "fast technology" which demands performance efficiency over a limited time for a specific use-case) so that users can fully become intimate with one another by stopping to reflect at each moment and using the object over a long period of time. The ritualistic nature of the narrative delivery not only slows time down, but also closes space down into a cup – the main touch point.

Image 1
Image 2
Image 3
Image 4

The visual effects projected onto the cup, such as a blinking red light for recording, allow users to access the state of the LLO

Storing memories

By saving the user's stories, the object becomes highly personalized as it holds the user’s “real context” and mediates others’ access to it. In doing so, gpTea becomes a vessel for memories which others can acces long after the user has passed.

To address concerns regarding privacy, we opted not to store the transcript of the user's conversation with the AI but rather have an LLM extract and summarize the user's story from the conversation before storing it. Through initial feedback from others, we realized that in doing so, we also provide the user an opportunity to leave a digital trace for the next user and introduce a tiny bit of discord into the system. The idea of a digital trace is fascinating as digital technologies are often programmed to refresh + reboot. It also sustains a thread of continuity between the ongoing conversation between the users through the LLO -- one user uses the LLO and leaves a digital trace that will influence the other user's experience and so on. It adds a delightful poetry to digital information, metaphorically akin to the residual tea leaves.

Chaining LLM prompts

In designing gpTea, we opted to “chain” several LLM prompts together.

We realized several benefits to this approach beyond improving the quality of generated outputs. For example, it naturally allows us to map each prompt with an interaction such that the user can focus on the experience of each interaction at hand and contributes to the “slowness” of the machine. The chaining of LLM prompts stretches time between each inputs-outputs exchange, allowing meaningful interaction to occur there. It also introduces the context from the user into the interaction experience, allowing users to “switch” or “exit” prompts. In gpTea, users are able to switch between a passive storytelling mode and an active storytelling mode depending on their desired interaction.

Contributions

Kevin Tang and Kelly Fang both contributed to the concept and design of the object. Kevin Tang contributed to the hardware and fabrication. Kelly Fang contributed to the software development and prompt engineering. Marcelo Coehlo provided design guidance throughout the process. Jean-Baptiste LaBrune helped us reflect on the final design and its significance in the broader context of HCI research.