3D visual perception tasks, including 3D detection and map segmentation based on multi-camera images, are essential for autonomous driving systems. In this work, we present a new framework termed BEVFormer, which learns unified BEV representations with spatiotemporal transformers to support multiple autonomous driving perception tasks. In a nutshell, BEVFormer exploits both spatial and temporal information by interacting with spatial and temporal space through predefined grid-shaped BEV queries. To aggregate spatial information, we design spatial cross-attention that each BEV query extracts the spatial features from the regions of interest across camera views. For temporal information, we propose temporal self-attention to recurrently fuse the history BEV information. Our approach achieves the new state-of-the-art 56.9\% in terms of NDS metric on the nuScenes \texttt{test} set, which is 9.0 points higher than previous best arts and on par with the performance of LiDAR-based baseline.

Features

  • Cutting-edge Baseline for Camera-based Detection
  • In this work, the authors present a new framework termed BEVFormer
  • BEVFormer exploits both spatial and temporal information
  • The proposed approach achieves the new state-of-the-art 56.9% in terms of NDS metric on the nuScenes test set
  • To aggregate spatial information, the authors design a spatial cross-attention that each BEV query extracts the spatial features
  • On par with the performance of LiDAR-based baselines

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow BEVFormer

BEVFormer Web Site

Other Useful Business Software
Forever Free Full-Stack Observability | Grafana Cloud Icon
Forever Free Full-Stack Observability | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of BEVFormer!

Additional Project Details

Programming Language

Python

Related Categories

Python Frameworks, Python Machine Learning Software, Python Autonomous Driving Software, Python LiDAR Software

Registered

2022-08-22