custom | custom

09/22/23 | Fruit flies offer clues to how brains make reward-based decisions

node:body | entity_field

Like many collectors of L.P. records, James Fitzgerald’s brother-in-law has a favorite store where he consistently finds the best vinyl for his collection. But there are times when he spends hours at the store and comes up empty. He also knows that occasionally he should venture to the record store on the other side of town, where he sometimes scores a hard-to-find gem that was stocked since his last visit.

Fitzgerald’s brother-in-law is making a calculation: weighing probable outcomes to guide his behavior. His favorite record store rewards him more frequently, so he visits that store the most. The second-tier store is less likely to reward him, so he visits that store only occasionally.

Glenn Turner, who like Fitzgerald is a neuroscientist and group leader at HHMI’s Janelia Research Campus, says this “record foraging” habit is a perfect example of a type of behavior called matching that is pervasive in the animal kingdom. Instead of vinyl, non-hipster animals like mice and flies forage for food, using sensory cues like odors to evaluate food quality from a distance.

But, while matching has been observed in everything from pigeons to mice to humans, it was unclear how the brain carried out this value-based decision-making. Researchers had previously proposed a theory for how that might happen, but the idea hadn’t been tested in the real world.

Now, a team of Janelia researchers that includes Fitzgerald, Turner, Janelia Graduate Scholar Adithya Rajagopalan, former Janelia Fellow Ran Darshan and Research Specialist Karen Hibbard has confirmed that the proposed theory works. Rajagopalan’s experiments showed that, like Fitzgerald’s brother-in-law, fruit flies can make decisions based on their expectations about the likelihood of a reward. The team also pinpointed the site in the fly brain where these value adjustments are made, enabling them to directly test this theory on the level of neural circuits.

“We found that flies are using expectation to assign value to their world,” Turner says. “It also really nicely connects back to this theoretical work that was so elegant and explains this widespread phenomenon.”

Uncovering how the fly brain carries out this ubiquitous behavior could help scientists better understand how similar decision-making happens in the brains of larger animals, including humans. Decision-making goes awry in diseases like addiction, so understanding how this process works in simpler brains has broad value, according to the researchers.

“The kinds of ideas and the theoretical framework that we have identified in this paper feel like a seed for evolution to build on in larger organisms, where more layers are added to allow for more complex behaviors,” says Rajagopalan, the first author of a new paper describing the work.

Videos of the fly in the Y-arena, explained in the schematic above. In the first video, the fly begins in one arm of the Y filled with clean odor-free air and chooses between two other arms that have been randomly assigned different odors, denoted by the different colors. The reward is provided on the third trial, when the fly chooses the orange odor. In the second video, the odor that the fly is moving in is denoted by the names on the bottom left. When the fly is in the clean air, both odors are grayed out. When the fly is an arm that contains 3-octanol, OCT is lit up in green and when the fly is in an arm that contains 4-methylcyclohexanol, MCH is lit up in yellow. Flies in this case are rewarded after choosing OCT. Credit: Rajagopalan et al.

Investigating matching behavior

Fruit flies, whose brains have been well studied and mapped, were an appealing choice for examining matching and its underlying mechanisms. But first, the team had to design a way to observe fruit fly decisions.

Rajagopalan, who came to the Turner Lab through a joint graduate program with Johns Hopkins University, spearheaded the project. He designed an experiment where a single fly enters one arm of a symmetrical Y-shaped arena. Odors are pumped into the other two arms of the Y. The fly chooses to follow one odor or the other and is rewarded – in this case by having its sugar-sensing neurons activated – but with different probabilities: One odor might translate into a reward 80 percent of the time, while the other odor might yield a reward 20 percent of the time.

The researchers found that the fly learned to expect the rewards in the same proportions they were presented and then made its choice based on those expectations. These actions give the matching behavior its name: 80 percent of the time, the fly chose the odor that gives 80 percent of the rewards. And 20 percent of the time, it chose the odor that yields 20 percent of the rewards.

The team tracked the behavior to specific synapses in the mushroom body, a region of the fly brain responsible for learning and memory. This enabled them to create a model of how the brain carries out this behavior, based on the theory of matching. In this theory, the values associated with different choices are learned through changes in synaptic strength: Synaptic connections are strengthened or weakened in proportion to the difference between expected and received reward. The team’s model based on this theory and the fly’s behavior allowed them to demonstrate how individual synapses are changing to enable value-based decision-making.

The new work emphasizes the important interplay between experiment and theory, converging on a description of the rules governing how an animal learns – an outcome that the researchers say is satisfying on both a conceptual and mechanistic level.

“To be able to see that you can get these sophisticated economic decisions through this simple mechanistic explanation about how synapses are changing is a great illustration of what mechanistic cognitive neuroscience can mean,” Fitzgerald says. “We’re taking this universal property and using the strengths of these small animals to really nail it mechanistically.”

###

Citation:

Adithya E. Rajagopalan, Ran Darshan, Karen L. Hibbard, James E. Fitzgerald, and Glenn C. Turner. “Reward expectations direct learning and drive operant matching in Drosophila.” PNAS. Published online September 22, 2023. DOI: 10.1073/pnas.2221415120

addtoany-addtoany_button | block