Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
raw
history blame
158 Bytes
For the last K timesteps, each of the three modalities are converted into token embeddings and processed by a GPT-like model to predict a future action token.