Spaces:

Ahmadzei
/

RAG

Runtime error

App Files Files Community

RAG / chunked /content_aware_chunking /_model_summary /chunk_15.txt

Ahmadzei

update 1

57bdca5 over 1 year ago

raw

history blame

371 Bytes

DeBERTa added a disentangled attention mechanism where the word and its position are separately encoded in two vectors. The attention is computed from these separate vectors instead of a single vector containing the word and position embeddings. Longformer also focused on making attention more efficient, especially for processing documents with longer sequence lengths.