I’m working on building a RAG system but ran into an issue with storing vectors at the token level for retrieval. I’m also curious about how to implement late interaction. Do you have any suggestions or sources that could help? Thanks in advance!
· Sign up or log in to comment