Waymo has pulled back the curtain on valuable datasets to help researchers better hone self-driving algorithms.

Jamie Davies

August 21, 2019

2 Min Read
Waymo opens-up data treasure trove for autonomous vehicles

Waymo has pulled back the curtain on valuable datasets to help researchers better hone self-driving algorithms.

While this is a nice gesture from the team, we suspect the lid will be kept shut on further datasets unless the idea becomes more mainstream. Data is king in the world of autonomous vehicles and this could prove to be a valuable bonanza for researchers and application developers throughout the world.

Waymo has said the datasets are not available for commercial use, though researchers in commercial organizations are free to access the data for their own development purposes.

“When it comes to research in machine learning, having access to data can turn an idea into a real innovation,” the team said in a Medium post.

“This data has the potential to help researchers make advances in 2D and 3D perception, and progress on areas such as domain adaptation, scene understanding and behaviour prediction. We hope that the research community will generate more exciting directions with our data that will not only help to make self-driving vehicles more capable, but also impact other related fields and applications, such as computer vision and robotics.”

When you look at the development of autonomous vehicles, nothing is more valuable than the right data, and those who collect it are usually very protective. Part of the reason for this is the effort which must be exerted to collect it, with companies like Waymo clocking up millions of miles on the road.

This release contains data from 1,000 driving segments, each capturing 20 seconds of continuous driving, corresponding to 200,000 frames at 10 Hz per sensor. Each segment contains sensor data from five high-resolution Waymo lidars and five front-and-side-facing cameras, offering a 360° view, as well as a total of 12 million 3D labels and 1.2 million 2D labels.

Such data would allow researchers to train models to track and predict the behaviour of other road users, as well as simulate certain situations to find the most appropriate outcome. The dataset covers various environments, from dense urban to suburban landscapes, as well as during day and night, at dawn and dusk, in sunshine and rain.

What is worth noting, as while this is the largest release of data for autonomous vehicles, it is not the first. Lyft released data last month, and Argo AI did so the month before.

The more data which is released to researchers, the quicker the autonomous dream can be realised, and the safer the final product will actually be. It does technically lessen the commercial edge of these organizations, but the final goal of getting autonomous vehicles on the road sooner rather than later seems to be more valuable.

You May Also Like