1. 📘 Topic and Domain: A foundation monocular depth estimation model called DepthAnything-AC for handling diverse environmental conditions in computer vision and depth estimation.
2. 💡 Previous Research and New Ideas: Based on previous foundation MDE models like Depth Anything series that work well in general scenes but struggle with complex conditions; proposes new unsupervised consistency regularization and spatial distance constraint approaches.
3. ❓ Problem: Existing foundation MDE models perform poorly in complex real-world environments involving challenging lighting, weather conditions, and sensor distortions, while also struggling with boundary delineation and detail preservation.
4. 🛠️ Methods: Uses perturbation-based consistency framework to generate consistent predictions under different corruptions, and spatial distance constraint to enforce geometric relationships between patches; fine-tuned on 540K unlabeled images with various augmentations.
5. 📊 Results and Evaluation: Outperformed state-of-the-art approaches across multiple benchmarks including DA-2K, real-world adverse weather datasets, and synthetic corruption benchmarks, while maintaining performance on general scenes; showed particular improvements in boundary definition and detail preservation.