Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
The encoder learns image representations and combines them with object queries (each object query is a learned embedding that focuses on a region or object in an image) in the decoder.