Learning, Detection, Representation, Indexing And Retrieval Of Multi-Agent Events In Videos