A MULTI-TASK VISION TRANSFORMER FOR SEGMENTATION AND MONOCULAR DEPTH ESTIMATION FOR AUTONOMOUS VEHICLES