TY - JOUR
T1 - SubjectDrive
T2 - 39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025
AU - Huang, Binyuan
AU - Wen, Yuqing
AU - Zhao, Yucheng
AU - Hu, Yaosi
AU - Liu, Yingfei
AU - Jia, Fan
AU - Mao, Weixin
AU - Wang, Tiancai
AU - Zhang, Chi
AU - Chen, Chang Wen
AU - Chen, Zhenzhong
AU - Zhang, Xiangyu
N1 - Publisher Copyright:
Copyright © 2025, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2025/4/11
Y1 - 2025/4/11
N2 - Autonomous driving progress relies on large-scale annotated datasets. In this work, we explore the potential of generative models to produce vast quantities of freely-labeled data for autonomous driving applications and present SubjectDrive, the first model proven to scale generative data production in a way that could continuously improve autonomous driving applications. We investigate the impact of scaling up the quantity of generative data on the performance of downstream perception models and find that enhancing data diversity plays a crucial role in effectively scaling generative data production. Therefore, we have developed a novel model equipped with a subject control mechanism, which allows the generative model to leverage diverse external data sources for producing varied and useful data. Extensive evaluations confirm SubjectDrive’s efficacy in generating scalable autonomous driving training data, marking a significant step toward revolutionizing data production methods in this field.
AB - Autonomous driving progress relies on large-scale annotated datasets. In this work, we explore the potential of generative models to produce vast quantities of freely-labeled data for autonomous driving applications and present SubjectDrive, the first model proven to scale generative data production in a way that could continuously improve autonomous driving applications. We investigate the impact of scaling up the quantity of generative data on the performance of downstream perception models and find that enhancing data diversity plays a crucial role in effectively scaling generative data production. Therefore, we have developed a novel model equipped with a subject control mechanism, which allows the generative model to leverage diverse external data sources for producing varied and useful data. Extensive evaluations confirm SubjectDrive’s efficacy in generating scalable autonomous driving training data, marking a significant step toward revolutionizing data production methods in this field.
UR - https://www.scopus.com/pages/publications/105003978134
U2 - 10.1609/aaai.v39i4.32376
DO - 10.1609/aaai.v39i4.32376
M3 - Conference article
AN - SCOPUS:105003978134
SN - 2159-5399
VL - 39
SP - 3617
EP - 3625
JO - Proceedings of the AAAI Conference on Artificial Intelligence
JF - Proceedings of the AAAI Conference on Artificial Intelligence
IS - 4
Y2 - 25 February 2025 through 4 March 2025
ER -