harsha163 commited on
Commit
80d8c3c
1 Parent(s): 6db14fd

fixed grammar

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -11,7 +11,7 @@ The full credit goes to: [Aritra Roy Gosthipaty](https://twitter.com/ariG23498),
11
 
12
  ## Short description:
13
 
14
- ViT and other Transformer based architectures represent the images as patches. As we increase the resolution of the images, the number of patches increase as well. To tackle this, Ryoo et al. introduced a new module called TokenLearner which can help reduce the number of patches used. The full paper can be found [here](https://openreview.net/forum?id=z-l1kpDXs88)
15
 
16
  ## Model and Dataset used
17
 
 
11
 
12
  ## Short description:
13
 
14
+ ViT and other Transformer based architectures break down images into patches. As we increase the resolution of the images, the number of patches increases as well. To tackle this, Ryoo et al. introduced a new module called TokenLearner which can help reduce the number of patches used. The full paper can be found [here](https://openreview.net/forum?id=z-l1kpDXs88)
15
 
16
  ## Model and Dataset used
17