Packing the car for a road trip might seem like a straightforward enough task, but it’s never been an easy one for robots to learn—until a new study turned the robot training over to artificial intelligence. The implications of this research go far beyond a well-packed trunk and could eventually impact things ranging from how we manage our homes to how we colonize Mars.
Using a form of generative AI known as a “diffusion model,” a team of researchers at the Massachusetts Institute of Technology and Stanford University trained robots to pack items into a limited space while adhering to a range of constraints: human concerns such as making sure that heavier items didn’t crush lighter ones, that some items had a certain amount of space between them, that a robot’s arm didn’t accidentally strike the container and damage it, and so on. The diffusion model helped the robots accomplish this faster than training methods used in the past, the researchers say.
“We want to have a learning-based method to solve constraints quickly because learning-based [AI] will solve faster, compared to traditional methods,” says M.I.T. Ph.D. student Zhutian “Skye” Yang, lead author of a paper detailing the study, which was recently released ahead of peer review on preprint server arXiv.org. A “learning-based” approach involves allowing an AI program to learn autonomously by identifying patterns between training data and the desired output. This differs from previously tested “rule-based” programs, which are more limited as they must behave within a strictly coded set of regulations. “The diffusion model is a very good method for sampling different solutions to a problem and jointly satisfying all of the constraints,” Yang says.
Autonomous packing “has been a challenging problem,” says Animesh Garg, an assistant professor of AI robotics at the Georgia Institute of Technology, who was not involved in the new study but works in a similar research area. “Without machine learning, the solution involves computationally intensive online 3-D bin packing”—a rule-based technique that “can even be unsolvable” depending on a program’s coded limitations.
Previously, for a robot to solve a packing problem within the aforementioned constraints, it would have to work sequentially. It would develop possible packing configurations and test each against one constraint at a time, then check for conflicts with the other constraints. This trial-and-error method proved too slow, especially when there were more items to pack—and therefore more actions to test. In the new study, the diffusion model, on the other hand, allowed a robot to simultaneously explore an array of machine-learning models, each representing an individual constraint. The sum total of these models afforded the robot a more thorough view of the problem, enabling it to consider all constraints at once, almost instantaneously. As a result, many more successful packing configurations were found faster than they had been with previous techniques. The study’s diffusion method also proved capable of solving new combinations of constraints that were applied to a larger number of items—beyond what the model experienced during training.
“Packing with robots is incredibly hard yet transformational,” Garg says. “This work enables robots to start ‘thinking’ on the fly and achieve very good, if not optimal, solutions quickly.”
“It’s a type of optimization problem,” Yang says. “With the learning-based method, we're happy to see that if we train on the small problems, it can generalize to solving problems with a larger number of objects or a larger set of constraints.”
The study team also looked at how its learning algorithm aligned with—or diverged from—most people’s intuition about how to pack. Humans “have heuristics of packing things to the edge first,” Yang says. “If you have a lot of things, you always pack them to the bottom left-hand side. Or if you are stacking things, you place things evenly, layer by layer, instead of all the way up one side and then the other.” While these heuristics may seem logical from a human perspective, learning-based robots without our preconceptions are free to discover novel solutions.
But by analyzing data ahead of time and keeping likely end solutions in mind before you start packing, you eliminate the need for trial and error. To pack multiple objects into a limited space—think a car trunk or a suitcase—like one of the study’s AI-powered robots, there are three steps. First, ponder ahead of time what you know about packing and what constraints must be met. Next, imagine solutions before you start loading objects. And finally, pack toward that ideal solution, not necessarily by following your intuition.
“There could be many solutions” that may not be intuitive, Yang says. “And you can change the plan as you go.”
Robots gaining an ability to pack faster and more efficiently than their human counterparts has applications far beyond road trips. “I want to have robots in the kitchen helping with housework,” Yang explains. “I just went to an industry robotic company to give a talk, and they are very interested in using this algorithm to pack for their customers.” For instance, she suggests the technique could help shipping companies pack disparate items into a single container or drug companies deliver a wide variety of medications to hospitals in bulk. The possibilities even transcend the planet. “If you’re going to Mars, you can have a robot decide how best to pack the resources,” Yang suggests.
Garg agrees the implications may be far-reaching. “Robotic packing and placement will enable a very large set of open-world robotic skills,” he says. More studies are needed, however. “This work has very impressive results, but it is still a few steps from considering the problem ‘solved,’” Garg says. “I hope that this work will galvanize the community to make quick progress in this domain.”
Now the team at M.I.T. and Stanford is working to make its robots even more capable at making “discrete decisions.” This involves not only teaching a robot to pack within constraints but also training it to do so within continuously shifting variables—for example, when tasked with packing items while simultaneously moving through a room.
So the next time you’re packing, consider doing it like a robot to optimize results. Before long, you might simply leave it entirely up to the machines.