UC Berkeley: “This is huge for patients who have damage to their speech mechanisms.”
Neuroscientists may one day be able to hear the imagined speech of a patient unable to speak due to stroke or paralysis, according to University of California, Berkeley, researchers.
These scientists have succeeded in decoding electrical activity in the brain’s temporal lobe — the seat of the auditory system — as a person listens to normal conversation. Based on this correlation between sound and brain activity, they then were able to predict the words the person had heard solely from the temporal lobe activity.
“This is huge for patients who have damage to their speech mechanisms because of a stroke or Lou Gehrig’s disease and can’t speak,” said co-author Robert Knight, a UC Berkeley professor of psychology and neuroscience. “If you could eventually reconstruct imagined conversations from brain activity, thousands of people could benefit.”
“This research is based on sounds a person actually hears, but to use it for reconstructing imagined conversations, these principles would have to apply to someone’s internal verbalizations,” cautioned first author Brian N. Pasley, a postdoctoral researcher in the center. “There is some evidence that hearing the sound and imagining the sound activate similar areas of the brain. If you can understand the relationship well enough between the brain recordings and sound, you could either synthesize the actual sound a person is thinking, or just write out the words with a type of interface device.”
In addition to the potential for expanding the communication ability of the severely disabled, he noted, the research also “is telling us a lot about how the brain in normal people represents and processes speech sounds.”
Pasley and his colleagues at UC Berkeley, UC San Francisco, University of Maryland and The Johns Hopkins University report their findings today (Jan. 31) in the open-access journal PLoS Biology.
They enlisted the help of people undergoing brain surgery to determine the location of intractable seizures so that the area can be removed in a second surgery. Neurosurgeons typically cut a hole in the skull and safely place electrodes on the brain surface or cortex — in this case, up to 256 electrodes covering the temporal lobe — to record activity over a period of a week to pinpoint the seizures. For this study, 15 neurosurgical patients volunteered to participate.
Pasley visited each person in the hospital to record the brain activity detected by the electrodes as they heard five to 10 minutes of conversation. Pasley used this data to reconstruct and play back the sounds the patients heard. He was able to do this because there is evidence that the brain breaks down sound into its component acoustic frequencies — for example, between a low of about 1 Hertz (cycles per second) to a high of about 8,000 Hertz — that are important for speech sounds.
Pasley tested two different computational models to match spoken sounds to the pattern of activity in the electrodes. The patients then heard a single word, and Pasley used the models to predict the word based on electrode recordings.
“We are looking at which cortical sites are increasing activity at particular acoustic frequencies, and from that, we map back to the sound,” Pasley said. He compared the technique to a pianist who knows the sounds of the keys so well that she can look at the keys another pianist is playing in a soundproof room and “hear” the music, much as Ludwig van Beethoven was able to “hear” his compositions despite being deaf.
The better of the two methods was able to reproduce a sound close enough to the original word for Pasley and his fellow researchers to correctly guess the word.
“We think we would be more accurate with an hour of listening and recording and then repeating the word many times,” Pasley said. But because any realistic device would need to accurately identify words heard the first time, he decided to test the models using only a single trial.
“This research is a major step toward understanding what features of speech are represented in the human brain” Knight said. “Brian’s analysis can reproduce the sound the patient heard, and you can actually recognize the word, although not at a perfect level.”