Robots chosen at random are more dependable

Northwestern University engineers have developed a new AI algorithm called Maximum Diffusion Reinforcement Learning (MaxDiff RL) designed specifically for smart robotics. The algorithm encourages robots to explore their environments as randomly as possible to gain a diverse set of experiences, leading to higher-quality data collection and faster learning. Simulated robots using MaxDiff RL consistently outperformed other AI platforms, successfully learning and performing new tasks within a single attempt.

The research, led by Thomas Berrueta, a Ph.D. candidate at Northwestern, and Todd Murphey, a robotics expert and professor at McCormick School of Engineering, focused on developing an algorithm that ensures robots can collect high-quality data independently. MaxDiff RL commands robots to move more randomly, allowing them to acquire necessary skills and accomplish useful tasks. By learning through self-curated random experiences, robots can learn more efficiently and effectively.

When tested against current state-of-the-art models in computer simulations, robots using MaxDiff RL learned faster and more consistently, often succeeding at tasks in a single attempt. The algorithm’s success lies in its ability to improve the quality of data collected and enable reliable decision-making in smart robotics, essential for applications like self-driving cars, delivery drones, household assistants, and automation. With MaxDiff RL, robots can generalize what they learn and apply it to new situations more effectively.

This new algorithm addresses the challenge of training embodied AI systems like robots, which collect data independently without human curation. Traditional algorithms that rely on large quantities of training data and trial and error are not compatible with robotics, as one failure could have catastrophic consequences. MaxDiff RL aims to bridge the gap by enabling robots to collect thorough, diverse data about their environments through designed randomness and self-curated experiences, ultimately improving their reliability and performance.

The study, supported by the U.S. Army Research Office and the U.S. Office of Naval Research, showcases the potential of MaxDiff RL for a variety of applications beyond robotic vehicles. The algorithm’s ability to facilitate faster learning, increased agility, and generalization of skills can benefit stationary robots like robotic arms in kitchens and in more complex physical environments. By addressing foundational issues in smart robotics, MaxDiff RL paves the way for more reliable decision-making in AI systems.

What's Hot

rewrite this title Mark Carney is the new Liberal leader, replacing Justin Trudeau

rewrite this title Vikings bring back Aaron Jones after successful first season in Minnesota

rewrite this title Exclusive | Dad of slain Newark detective calls son’s murder by 14-year-old suspect ‘a pain that is indescribable’

rewrite this title Clinical trial tests novel stem-cell treatment for Parkinson’s disease

rewrite this title Scientists develop advanced forest monitoring systems: Will forests monitor themselves in the future?

rewrite this title Nearly half of popular tropical plant group related to birds-of-paradise and bananas are threatened with extinction

rewrite this title AI reveals new way to strengthen titanium alloys and speed up manufacturing

rewrite this title Improving school readiness for children with low birth weight

rewrite this title Drug more than doubles survival time for glioblastoma patients

rewrite this title Developing the inherent functionality of highly pure porous organic polymers

rewrite this title Large study of dietary habits suggests more plant oils, less butter could lead to better health

rewrite this title Breakthrough cardiac regeneration research offers hope for the treatment of ischemic heart failure

World

Business

More Topics

Company