If employees are ready to come back to the office, the technology for a healthier office is here to help teams collaborate and build culture face to face.
TL; DR: We find that current self-supervised learning approaches suffer from poor visual grounding and receive improper supervisory signal when trained on complex scene images. We introduce CAST to improve visual grounding during…