Update README.md
Browse files
    	
        README.md
    CHANGED
    
    | @@ -61,10 +61,9 @@ model. You can prompt the model for completions on the command line too: | |
| 61 | 
             
            ```
         | 
| 62 |  | 
| 63 | 
             
            This model has a max context window size of 128k tokens. By default, a
         | 
| 64 | 
            -
            context window size of 512 tokens is used. You  | 
| 65 | 
            -
             | 
| 66 | 
            -
             | 
| 67 | 
            -
            and it incurs significantly less memory overhead.
         | 
| 68 |  | 
| 69 | 
             
            On GPUs with sufficient RAM, the `-ngl 999` flag may be passed to use
         | 
| 70 | 
             
            the system's NVIDIA or AMD GPU(s). On Windows, only the graphics card
         | 
|  | |
| 61 | 
             
            ```
         | 
| 62 |  | 
| 63 | 
             
            This model has a max context window size of 128k tokens. By default, a
         | 
| 64 | 
            +
            context window size of 512 tokens is used. You can use a larger context
         | 
| 65 | 
            +
            window by passing the `-c 8192` flag. The software currently has
         | 
| 66 | 
            +
            limitations that may prevent scaling to the full 128k size.
         | 
|  | |
| 67 |  | 
| 68 | 
             
            On GPUs with sufficient RAM, the `-ngl 999` flag may be passed to use
         | 
| 69 | 
             
            the system's NVIDIA or AMD GPU(s). On Windows, only the graphics card
         | 

