Connect with us

Science & Technology

Collaborative machine learning that preserves privacy

Researchers increase the accuracy and efficiency of a machine-learning method that safeguards user data

EP Staff



Written by Adam Zewe, MIT News Office

Training a machine-learning model to effectively perform a task, such as image classification, involves showing the model thousands, millions, or even billions of example images. Gathering such enormous datasets can be especially challenging when privacy is a concern, such as with medical images. Researchers from MIT and the MIT-born startup DynamoFL have now taken one popular solution to this problem, known as federated learning, and made it faster and more accurate.

Federated learning is a collaborative method for training a machine-learning model that keeps sensitive user data private. Hundreds or thousands of users each train their own model using their own data on their own device. Then users transfer their models to a central server, which combines them to come up with a better model that it sends back to all users.

A collection of hospitals located around the world, for example, could use this method to train a machine-learning model that identifies brain tumors in medical images, while keeping patient data secure on their local servers.

But federated learning has some drawbacks. Transferring a large machine-learning model to and from a central server involves moving a lot of data, which has high communication costs, especially since the model must be sent back and forth dozens or even hundreds of times. Plus, each user gathers their own data, so those data don’t necessarily follow the same statistical patterns, which hampers the performance of the combined model. And that combined model is made by taking an average — it is not personalized for each user.

The researchers developed a technique that can simultaneously address these three problems of federated learning. Their method boosts the accuracy of the combined machine-learning model while significantly reducing its size, which speeds up communication between users and the central server. It also ensures that each user receives a model that is more personalized for their environment, which improves performance.

The researchers were able to reduce the model size by nearly an order of magnitude when compared to other techniques, which led to communication costs that were between four and six times lower for individual users. Their technique was also able to increase the model’s overall accuracy by about 10 percent.

“A lot of papers have addressed one of the problems of federated learning, but the challenge was to put all of this together. Algorithms that focus just on personalization or communication efficiency don’t provide a good enough solution. We wanted to be sure we were able to optimize for everything, so this technique could actually be used in the real world,” says Vaikkunth Mugunthan PhD ’22, lead author of a paper that introduces this technique.

Mugunthan wrote the paper with his advisor, senior author Lalana Kagal, a principal research scientist in the Computer Science and Artificial Intelligence Laboratory (CSAIL). The work will be presented at the European Conference on Computer Vision.

Cutting a model down to size

The system the researchers developed, called FedLTN, relies on an idea in machine learning known as the lottery ticket hypothesis. This hypothesis says that within very large neural network models there exist much smaller subnetworks that can achieve the same performance. Finding one of these subnetworks is akin to finding a winning lottery ticket. (LTN stands for “lottery ticket network.”)

Neural networks, loosely based on the human brain, are machine-learning models that learn to solve problems using interconnected layers of nodes, or neurons.

Finding a winning lottery ticket network is more complicated than a simple scratch-off. The researchers must use a process called iterative pruning. If the model’s accuracy is above a set threshold, they remove nodes and the connections between them (just like pruning branches off a bush) and then test the leaner neural network to see if the accuracy remains above the threshold.

Other methods have used this pruning technique for federated learning to create smaller machine-learning models which could be transferred more efficiently. But while these methods may speed things up, model performance suffers.

Mugunthan and Kagal applied a few novel techniques to accelerate the pruning process while making the new, smaller models more accurate and personalized for each user.

They accelerated pruning by avoiding a step where the remaining parts of the pruned neural network are “rewound” to their original values. They also trained the model before pruning it, which makes it more accurate so it can be pruned at a faster rate, Mugunthan explains.

To make each model more personalized for the user’s environment, they were careful not to prune away layers in the network that capture important statistical information about that user’s specific data. In addition, when the models were all combined, they made use of information stored in the central server so it wasn’t starting from scratch for each round of communication.

They also developed a technique to reduce the number of communication rounds for users with resource-constrained devices, like a smart phone on a slow network. These users start the federated learning process with a leaner model that has already been optimized by a subset of other users.

Winning big with lottery ticket networks

When they put FedLTN to the test in simulations, it led to better performance and reduced communication costs across the board. In one experiment, a traditional federated learning approach produced a model that was 45 megabytes in size, while their technique generated a model with the same accuracy that was only 5 megabytes. In another test, a state-of-the-art technique required 12,000 megabytes of communication between users and the server to train one model, whereas FedLTN only required 4,500 megabytes.

With FedLTN, the worst-performing clients still saw a performance boost of more than 10 percent. And the overall model accuracy beat the state-of-the-art personalization algorithm by nearly 10 percent, Mugunthan adds.

Now that they have developed and finetuned FedLTN, Mugunthan is working to integrate the technique into a federated learning startup he recently founded, DynamoFL.

Moving forward, he hopes to continue enhancing this method. For instance, the researchers have demonstrated success using datasets that had labels, but a greater challenge would be applying the same techniques to unlabeled data, he says.

Mugunthan is hopeful this work inspires other researchers to rethink how they approach federated learning.

“This work shows the importance of thinking about these problems from a holistic aspect, and not just individual metrics that have to be improved. Sometimes, improving one metric can actually cause a downgrade in the other metrics. Instead, we should be focusing on how we can improve a bunch of things together, which is really important if it is to be deployed in the real world,” he says.

Science & Technology

Passive cooling system could benefit off-grid locations

Relying on evaporation and radiation — but not electricity — the system could keep food fresh longer or supplement air conditioning in buildings

EP Staff



Written by David L. Chandler, MIT News Office

As the world gets warmer, the use of power-hungry air conditioning systems is projected to increase significantly, putting a strain on existing power grids and bypassing many locations with little or no reliable electric power. Now, an innovative system developed at MIT offers a way to use passive cooling to preserve food crops and supplement conventional air conditioners in buildings, with no need for power and only a small need for water.

The system, which combines radiative cooling, evaporative cooling, and thermal insulation in a slim package that could resemble existing solar panels, can provide up to about 19 degrees Fahrenheit (9.3 degrees Celsius) of cooling from the ambient temperature, enough to permit safe food storage for about 40 percent longer under very humid conditions. It could triple the safe storage time under dryer conditions.

The findings are reported in the journal Cell Reports Physical Science, in a paper by MIT postdoc Zhengmao Lu, Arny Leroy PhD ’21, professors Jeffrey Grossman and Evelyn Wang, and two others. While more research is needed in order to bring down the cost of one key component of the system, the researchers say that eventually such a system could play a significant role in meeting the cooling needs of many parts of the world where a lack of electricity or water limits the use of conventional cooling systems.

The system cleverly combines previous standalone cooling designs that each provide limited amounts of cooling power, in order to produce significantly more cooling overall — enough to help reduce food losses from spoilage in parts of the world that are already suffering from limited food supplies. In recognition of that potential, the research team has been partly supported by MIT’s Abdul Latif Jameel Water and Food Systems Lab.

“This technology combines some of the good features of previous technologies such as evaporative cooling and radiative cooling,” Lu says. By using this combination, he says, “we show that you can achieve significant food life extension, even in areas where you have high humidity,” which limits the capabilities of conventional evaporative or radiative cooling systems.

In places that do have existing air conditioning systems in buildings, the new system could be used to significantly reduce the load on these systems by sending cool water to the hottest part of the system, the condenser. “By lowering the condenser temperature, you can effectively increase the air conditioner efficiency, so that way you can potentially save energy,” Lu says.

Other groups have also been pursuing passive cooling technologies, he says, but “by combining those features in a synergistic way, we are now able to achieve high cooling performance, even in high-humidity areas where previous technology generally cannot perform well.”

The system consists of three layers of material, which together provide cooling as water and heat pass through the device. In practice, the device could resemble a conventional solar panel, but instead of putting out electricity, it would directly provide cooling, for example by acting as the roof of a food storage container. Or, it could be used to send chilled water through pipes to cool parts of an existing air conditioning system and improve its efficiency. The only maintenance required is adding water for the evaporation, but the consumption is so low that this need only be done about once every four days in the hottest, driest areas, and only once a month in wetter areas.

The top layer is an aerogel, a material consisting mostly of air enclosed in the cavities of a sponge-like structure made of polyethylene. The material is highly insulating but freely allows both water vapor and infrared radiation to pass through. The evaporation of water (rising up from the layer below) provides some of the cooling power, while the infrared radiation, taking advantage of the extreme transparency of Earth’s atmosphere at those wavelengths, radiates some of the heat straight up through the air and into space — unlike air conditioners, which spew hot air into the immediate surrounding environment.

Below the aerogel is a layer of hydrogel — another sponge-like material, but one whose pore spaces filled with water rather than air. It’s similar to material currently used commercially for products such as cooling pads or wound dressings. This provides the water source for evaporative cooling, as water vapor forms at its surface and the vapor passes up right through the aerogel layer and out to the environment.

Below that, a mirror-like layer reflects any incoming sunlight that has reached it, sending it back up through the device rather than letting it heat up the materials and thus reducing their thermal load. And the top layer of aerogel, being a good insulator, is also highly solar-reflecting, limiting the amount of solar heating of the device, even under strong direct sunlight.

“The novelty here is really just bringing together the radiative cooling feature, the evaporative cooling feature, and also the thermal insulation feature all together in one architecture,” Lu explains. The system was tested, using a small version, just 4 inches across, on the rooftop of a building at MIT, proving its effectiveness even during suboptimal weather conditions, Lu says, and achieving 9.3 C of cooling (18.7 F).

“The challenge previously was that evaporative materials often do not deal with solar absorption well,” Lu says. “With these other materials, usually when they’re under the sun, they get heated, so they are unable to get to high cooling power at the ambient temperature.”

The aerogel material’s properties are a key to the system’s overall efficiency, but that material at present is expensive to produce, as it requires special equipment for critical point drying (CPD) to remove solvents slowly from the delicate porous structure without damaging it. The key characteristic that needs to be controlled to provide the desired characteristics is the size of the pores in the aerogel, which is made by mixing the polyethylene material with solvents, allowing it to set like a bowl of Jell-O, and then getting the solvents out of it. The research team is currently exploring ways of either making this drying process more inexpensive, such as by using freeze-drying, or finding alternative materials that can provide the same insulating function at lower cost, such as membranes separated by an air gap.

While the other materials used in the system are readily available and relatively inexpensive, Lu says, “the aerogel is the only material that’s a product from the lab that requires further development in terms of mass production.” And it’s impossible to predict how long that development might take before this system can be made practical for widespread use, he says.

The research team included Lenan Zhang of MIT’s Department of Mechanical Engineering and Jatin Patil of the Department of Materials Science and Engineering.

Continue Reading

Science & Technology

Scientists identify a plant molecule that sops up iron-rich heme

The peptide is used by legumes to control nitrogen-fixing bacteria; it may also offer leads for treating patients with too much heme in their blood

EP Staff



Written by Anne Trafton, MIT News Office

Symbiotic relationships between legumes and the bacteria that grow in their roots are critical for plant survival. Without those bacteria, the plants would have no source of nitrogen, an element that is essential for building proteins and other biomolecules, and they would be dependent on nitrogen fertilizer in the soil. 

To establish that symbiosis, some legume plants produce hundreds of peptides that help bacteria live within structures known as nodules within their roots. A new study from MIT reveals that one of these peptides has an unexpected function: It sops up all available heme, an iron-containing molecule. This sends the bacteria into an iron-starvation mode that ramps up their production of ammonia, the form of nitrogen that is usable for plants.

“This is the first of the 700 peptides in this system for which a really detailed molecular mechanism has been worked out,” says Graham Walker, the American Cancer Society Research Professor of Biology at MIT, a Howard Hughes Medical Institute Professor, and the senior author of the study.

This heme-sequestering peptide could have beneficial uses in treating a variety of human diseases, the researchers say. Removing free heme from the blood could help to treat diseases caused by bacteria or parasites that need heme to survive, such as P. gingivalis (periodontal disease) or toxoplasmosis, or diseases such as sickle cell disease or sepsis that release too much heme into the bloodstream.

“This study demonstrates that basic research in plant-microbe interactions also has potential to be translated to therapeutic applications,” says Siva Sankari, an MIT research scientist and the lead author of the study, which appears today in Nature Microbiology.

Other authors of the paper include Vignesh Babu, an MIT research scientist; Kevin Bian and Mary Andorfer, both MIT postdocs; Areej Alhhazmi, a former KACST-MIT Ibn Khaldun Fellowship for Saudi Arabian Women scholar; Kwan Yoon and Dante Avalos, MIT graduate students; Tyler Smith, an MIT instructor in biology; Catherine Drennan, an MIT professor of chemistry and biology and a Howard Hughes Medical Institute investigator; Michael Yaffe, a David H. Koch Professor of Science and a member of MIT’s Koch Institute for Integrative Cancer Research; and Sebastian Lourido, the Latham Family Career Development Professor of Biology at MIT and a member of the Whitehead Institute for Biomedical Research.

Iron control

For nearly 40 years, Walker’s lab has been studying the symbiosis between legumes and rhizobia, a type of nitrogen-fixing bacteria. These bacteria convert nitrogen gas to ammonia, a critical step of the Earth’s nitrogen cycle that makes the element available to plants (and to animals that eat the plants).

Most of Walker’s work has focused on a clover-like plant called Medicago truncatula. Nitrogen-fixing bacteria elicit the formation of nodules on the roots of these plants and eventually end up inside the plant cells, where they convert to their symbiotic form called bacteroids.

Several years ago, plant biologists discovered that Medicago truncatula produces about 700 peptides that contribute to the formation of these bacteroids. These peptides are generated in waves that help the bacteria make the transition from living freely to becoming embedded into plant cells where they act as nitrogen-fixing machines.

Walker and his students picked one of these peptides, known as NCR247, to dig into more deeply. Initial studies revealed that when nitrogen-fixing bacteria were exposed to this peptide, 15 percent of their genes were affected. Many of the genes that became more active were involved in importing iron.

The researchers then found that when they fused NCR247 to a larger protein, the hybrid protein was unexpectedly reddish in color. This serendipitous observation led to the discovery that NCR247 binds heme, an organic ring-shaped iron-containing molecule that is an important component of hemoglobin, the protein that red blood cells use to carry oxygen.

Further studies revealed that when NCR247 is released into bacterial cells, it sequesters most of the heme in the cell, sending the cells into an iron-starvation mode that triggers them to begin importing more iron from the external environment.

“Usually bacteria fine-tune their iron metabolism, and they don’t take up more iron when there is already enough,” Sankari says. “What’s cool about this peptide is that it overrides that mechanism and indirectly regulates the iron content of the bacteria.”

Nitrogenase, the main enzyme that bacteria use to fix nitrogen, requires 24 to 32 atoms of iron per enzyme molecule, so the influx of extra iron likely helps those enzymes to become more active, the researchers say. This influx is timed to coincide with nitrogen fixation, they found.

“These peptides are produced in a wave in the nodules, and the production of this particular peptide is higher when the bacteria are preparing to fix nitrogen. If this peptide was secreted throughout the whole process, then the cell would have too much iron all the time, which is bad for the cell,” Sankari says.

Without the NCR247 peptide, Medicago truncatula and rhizobium cannot form an effective nitrogen-fixing symbiosis, the researchers showed.

“Many possible directions”

The peptide that the researchers studied in this work may have potential therapeutic uses. When heme is incorporated into hemoglobin, it performs a critical function in the body, but when it’s loose in the bloodstream, it can kill cells and promote inflammation. Free heme can accumulate in stored blood, so having a way to filter out the heme before the blood is transfused into a patient could be potentially useful.

A variety of human diseases lead to free heme circulating in the bloodstream, including sickle cell anemia, sepsis, and malaria. Additionally, some infectious parasites and bacteria depend on heme for their survival but cannot produce it, so they scavenge it from their environment. Treating such infections with a protein that takes up all available heme could help prevent the parasitic or bacterial cells from being able to grow and reproduce.

In this study, Lourido and members of his lab showed that treating the parasite Toxoplasma gondii with NCR427 prevented the parasite from forming plaques on human cells.

The researchers are now pursuing collaborations with other labs at MIT to explore some of these potential applications, with funding from a Professor Amar G. Bose Research Grant.

“There are many possible directions, but they’re all at a very early stage,” Walker says. “The number of potential clinical applications is very broad. You can place more than one bet in this game, which is an intriguing thing.”

Currently, the human protein hemopexin, which also binds to heme, is being explored as a possible treatment for sickle cell anemia. The NCR247 peptide could provide an easier to deploy alternative, the researchers say, because it is much smaller and could be easier to manufacture and deliver into the body.

The research was funded in part by the MIT Center for Environmental Health Sciences, the National Science Foundation, and the National Institutes of Health.

Continue Reading

Science & Technology

Measuring the “woodwork effect” in medical insurance

Study: When adults gain access to Medicaid, they sign up their previously unenrolled kids, too — yet many more remain outside the system

EP Staff



Written by Peter Dizikes, MIT News Office

Not everyone who qualifies for health insurance signs up for it. Consider Medicaid, the national health insurance plan for low-income people. Across the U.S., about 14 percent of eligible adults and 7 percent of eligible children are not enrolled in Medicaid.

As it happens, when adults do enroll in Medicaid, some of them sign up their eligible children for it, too. This is an example of a “woodwork effect,” as policy analysts have termed it — sometimes, people eligible for social programs may come out of the woodwork, as it were, to claim benefits.

A new study led by an MIT economist quantifies this effect, using Oregon as a case study. The research shows that for every nine adults who gained access to Medicaid in Oregon due to a special enrollment lottery, one previously eligible child was added to the Medicaid rolls as well.

But while the findings show that woodwork effects exist in social insurance, in Oregon the effect was not large enough to create major pressures on its Medicaid system, which is jointly funded by the federal and state governments. Most of the eligible children who were not already enrolled in Medicaid remained unenrolled; only about 6 percent of those who could have become enrolled were signed up when an adult in their household won the Oregon lottery.

“We find evidence of these woodwork effects,” says Amy Finkelstein, a professor in MIT’s Department of Economics and co-author of a new paper detailing the results. “We reject the hypothesis that these types of spillovers don’t occur. On the other hand, relative to claims in the media and in some previous work about potentially large woodwork effects, in excess of half of the direct effect … our effects are quantitatively much smaller than what was conjectured.”

The paper, “Out of the Woodwork: Enrollment Spillovers in the Oregon Health Insurance Experiment,” appears in the American Economic Journal: Economic Policy. The paper’s co-authors are Adam Sacarny PhD ’14, an assistant professor at the Columbia University Mailman School of Public Health; Katherine Baicker, the dean and the Emmett Dedmon Professor at the University of Chicago Harris School of Public Policy; and Finkelstein, the John and Jennie S. MacDonald Professor of Economics at MIT.

Winning the insurance lottery

To conduct the research, the scholars used data from the Oregon Health Insurance Experiment of 2008, a unique project conducted by the state of Oregon. Given enough funding to allow some Medicaid expansion to low-income, uninsured adults, Oregon ran a lottery for new Medicaid entry, receiving about 90,000 applications for 10,000 new slots.

That formed the basis of a useful experiment: Because those who win and lose the lottery do so at random, scholars can compare what subsequently happens to those who do and do not win the lottery to determine the effects of obtaining health insurance. Finkelstein, Baicker, and other colleagues have published several studies based on the Oregon lottery, showing that having Medicaid increases health care use, reduces out-of-pocket spending and medical debt, and lowers incidence of depression, among other things.

Because the children of the adults who participated in the lottery in Oregon were already eligible for Medicaid, the lottery allowed the researchers to ask: If adults obtain Medicaid, does that make them more likely to enroll their children, too?

“This enabled us to look at the question of what happens to children of adults who win the lottery, compared to children of adults who don’t win the lottery,” Finkelstein says. “We were just trying to get a sense of whether there were impacts on the children, and how large these effects were.”

The effect was real, but modest in size and shrank over time. A year onward from the lottery, the enrollment difference among children from lottery-winning and lottery-losing households was about one-third its initial size; some lottery-winning adults saw the enrollment status of their children lapse, while some children of adults who lost the lottery ended up eventually enrolling in Medicaid.

“The magnitude of the effect is economically and practically meaningful, but the effect is fairly short-lived,” Sacarny observes.

The findings add information to a public discussion that arose after the Affordable Care Act (ACA) was signed into law by President Barack Obama in 2010. The ACA allowed states to expand Medicaid to additional low-income adults, although many states did not. Some observers suggested that woodwork effects on children’s enrollment might greatly increase the taxpayer cost of the adult Medicaid expansion. The current study suggests these costs may be modest.

As Finkelstein notes, however, the current study is simply intended to inform the public discussion over Medicaid and woodwork effects, and to produce better estimates about insurance expansion.

“Whether you think previously eligible children enrolling in Medicaid when their parents become eligible is an extra benefit or an add-on cost of adult Medicaid expansion depends on your views of the costs and benefits of public health insurance,” Finkelstein says. In any case, Finkelstein observes, the cost of covering children through Medicaid is roughly four times smaller than the cost of covering adults.

“From a budget perspective, children tend to be much cheaper to cover than adults,” Finkelstein says. “They have lower health expenditures.”

Understanding the barriers to enrollment

The current paper also adds to an existing literature about the barriers to enrollment for health insurance and other social programs. There are a variety of reasons why people who are eligible for social programs may not enroll: They may not know they are eligible, may find the process too complicated, or may feel there is a stigma associated with such programs.

Finkelstein, for her part, has studied this issue too. Along with MIT economics colleagues Abhijit Banerjee and Benjamin Olken, among other scholars, she co-authored a paper last year about an experiment designed to encourage people to enroll people in Indonesia’s national health insurance program. The study found some benefit from subsidies and enrollment assistance, but no apparent benefit to the simple provision of information to people.

As Sacarny points out, the present study also demonstrates the many ways that randomized trials, like Oregon’s, can be used to generate further findings. Given a valid experiment, scholars can think creatively about how to identify its effects, and keep leveraging that experiment to produce rigorous results.

“This research highlights the value of conducting further secondary studies of randomized trials,” Sacarny says. “What we’re showing here is that when you link trials with additional administrative data, you can use them to study additional, potentially really important questions for economic and social policy.”

The current paper may also be the last one Finkelstein works on that derives from the Oregon Health Insurance Experiment of 2008; she has co-authored at least eight other refereed papers exploring the effects of Medicaid enrollment on people, work that has gained wide attention and helped inform the public discussion about health insurance.

“For me, it’s a bit of an end of an era,” Finkelstein says. However, she and her colleagues have developed a public use data file so that other researchers can dig into all the data from Oregon, and potentially surface additional findings as well.

The study was supported, in part, by the National Institute on Aging.

Continue Reading