In Search of Lost Discourse



In this sketch I was concerned with how a small device with limited physical memory could possibly remember something as expansive as Proust's "In Search Of Lost Time". I looked into a number of natural language processing algorithms, all of which depended on huge internal data sets and intense computational power. In my research I came across the ideas of discourse analysis as defined by Michel Foucault and Jürgen Habermas and specifically the idea that phrases of words are in constant competition. I decided that one way to remember Proust would be to remember a meaningful phrase from the book, something that stood out from the rest.
I designed a fitness algorithm that tests a phrase for the average length of words, the longest and shortest word (and the distance between these values and the average), as well as the frequencies of each letter. Each incoming phrases is tested against the currently stored phrase and if it is found to be more fit, it becomes the phrase stored in the device's EEPROM memory. The device will load this phrase from EEPROM when it is turned on, so it can pick up the story where it left off. In this way, the hardware is not only remembring Proust as it reads, but it is remembering the novel as long as it remains functional. As time passes however, the remembered phrases looses some of its luster and decays in fitness, allowing it to be replaced with a newer one.
This sketch was simulated in C++ and tested using the WinAVR compiler to make sure it fit the memory requirements of the Atmega32. After seeing the simulation, I decided that there were huge flaws in sketch and abandoned it. First of all, this was a project that had no need for an audience or any form of human interaction. It was simply an algorithm processing a body a text. This project did, however, became the seed for the text-based sampler. My plan is to allow the user to determine what is memorable within a huge stream of text, and give them the tools to generate a narrative out of these pieces. Allowing human control over the fitness algorithm will completely change the nature of the work, making it something that grows by human and audience participation.



