Stories with Tag Vision-Language-Action Models