Scale-Adaptive Video Understanding.