Commits · THUDM/chatglm-6b-int4

Update README.md

826ca34
verified

yuxiaod commited on Aug 4, 2024

Merge branch 'main' of https://huggingface.co/THUDM/chatglm-6b-int4

6c5205c

duzx16 commited on Jul 8, 2023

Update license

bb09de3

duzx16 commited on Jul 8, 2023

Upload pytorch_model.bin

02a065c

zxdu20 commited on May 15, 2023

Update slack link

e214c5b

zxdu20 commited on May 12, 2023

Update decode method in tokenizer

d8a6cfc

duzx16 commited on May 9, 2023

Add support for parallel quantization on Mac

f6b88da

duzx16 commited on May 4, 2023

Remove assert in load_cpu_kernel

63d66b0

duzx16 commited on Apr 29, 2023

Sync with chatglm-6b

f55a108

duzx16 commited on Apr 28, 2023

Remove pytorch_model.bin.index.json

e02ba89

duzx16 commited on Apr 17, 2023

Update slack link

6498797

duzx16 commited on Apr 17, 2023

Add pytorch_model.bin.index.json

1e40d96

duzx16 commited on Apr 16, 2023

Add assertion when loading cpu and cuda kernel fails

630d0ef

songxxzp commited on Apr 14, 2023

Add assertion when loading cpu and cuda kernel fails

bcc35f0

songxxzp commited on Apr 14, 2023

Merge branch 'dev'

fe0674f

songxxzp commited on Apr 14, 2023

Update CPU kernel loading method

c7d8998

songxxzp commited on Apr 14, 2023

Fix gmask

3485994

duzx16 commited on Apr 14, 2023

Add empty_init option

9333486

duzx16 commited on Apr 13, 2023

Update README.md

6466cdc

duzx16 commited on Apr 13, 2023

Fix eos token in tokenizer

9163f7e

duzx16 commited on Apr 11, 2023

Update dependency

649466f

duzx16 commited on Apr 9, 2023

Fix attention score on mps

41fda88

duzx16 commited on Apr 9, 2023

Fix logit processor

a7272d4

duzx16 commited on Apr 8, 2023

Merge branch 'slim' of https://huggingface.co/THUDM/chatglm-6b-int4 into slim

96de7a2

duzx16 commited on Apr 7, 2023

Fix embedding quantization

5fc46d2

duzx16 commited on Apr 7, 2023

Upload pytorch_model.bin

7edbdfe

zxdu20 commited on Apr 7, 2023

Slim embedding

bfb1a8f

duzx16 commited on Apr 7, 2023

Fix bugs when compiling cpu kernels

68873da

DrSong commited on Apr 6, 2023

Drop icetk dependency

1f34060

duzx16 commited on Apr 6, 2023

Fix position ids expand

19685a5

duzx16 commited on Apr 3, 2023

Synchronize with chatglm 6b repo

7aaf3fe

DrSong commited on Apr 3, 2023

Fix parallel cpu kernel

7458231

DrSong commited on Apr 1, 2023

Fix bugs in quantization when loading kernels

dac03c3

DrSong commited on Mar 22, 2023

Fix Chinese punctuation

debaf00

duzx16 commited on Mar 22, 2023

Update README.md

3ba9437

Sengxian commited on Mar 20, 2023

Update README.md

0d0e806

Sengxian commited on Mar 20, 2023

Update README.md

7ad727c

Sengxian commited on Mar 20, 2023

init commmit

a93efa9

Sengxian commited on Mar 19, 2023

initial commit

62a9758

Zhengxiao Du commited on Mar 19, 2023

Commit History

Update README.md 826ca34 verified

Merge branch 'main' of https://huggingface.co/THUDM/chatglm-6b-int4 6c5205c

Update license bb09de3

Upload pytorch_model.bin 02a065c

Update slack link e214c5b

Update decode method in tokenizer d8a6cfc

Add support for parallel quantization on Mac f6b88da

Remove assert in load_cpu_kernel 63d66b0

Sync with chatglm-6b f55a108

Remove pytorch_model.bin.index.json e02ba89

Update slack link 6498797

Add pytorch_model.bin.index.json 1e40d96

Add assertion when loading cpu and cuda kernel fails 630d0ef

Add assertion when loading cpu and cuda kernel fails bcc35f0

Merge branch 'dev' fe0674f

Update CPU kernel loading method c7d8998

Fix gmask 3485994

Add empty_init option 9333486

Update README.md 6466cdc

Fix eos token in tokenizer 9163f7e

Update dependency 649466f

Fix attention score on mps 41fda88

Fix logit processor a7272d4

Merge branch 'slim' of https://huggingface.co/THUDM/chatglm-6b-int4 into slim 96de7a2

Fix embedding quantization 5fc46d2

Upload pytorch_model.bin 7edbdfe

Slim embedding bfb1a8f

Fix bugs when compiling cpu kernels 68873da

Drop icetk dependency 1f34060

Fix position ids expand 19685a5

Synchronize with chatglm 6b repo 7aaf3fe

Fix parallel cpu kernel 7458231

Fix bugs in quantization when loading kernels dac03c3

Fix Chinese punctuation debaf00

Update README.md 3ba9437

Update README.md 0d0e806

Update README.md 7ad727c

init commmit a93efa9

initial commit 62a9758

Update README.md

826ca34
verified

Merge branch 'main' of https://huggingface.co/THUDM/chatglm-6b-int4

6c5205c

Update license

bb09de3

Upload pytorch_model.bin

02a065c

Update slack link

e214c5b

Update decode method in tokenizer

d8a6cfc

Add support for parallel quantization on Mac

f6b88da

Remove assert in load_cpu_kernel

63d66b0

Sync with chatglm-6b

f55a108

Remove pytorch_model.bin.index.json

e02ba89

Update slack link

6498797

Add pytorch_model.bin.index.json

1e40d96

Add assertion when loading cpu and cuda kernel fails

630d0ef

Add assertion when loading cpu and cuda kernel fails

bcc35f0

Merge branch 'dev'

fe0674f

Update CPU kernel loading method

c7d8998

Fix gmask

3485994

Add empty_init option

9333486

Update README.md

6466cdc

Fix eos token in tokenizer

9163f7e

Update dependency

649466f

Fix attention score on mps

41fda88

Fix logit processor

a7272d4

Merge branch 'slim' of https://huggingface.co/THUDM/chatglm-6b-int4 into slim

96de7a2

Fix embedding quantization

5fc46d2

Upload pytorch_model.bin

7edbdfe

Slim embedding

bfb1a8f

Fix bugs when compiling cpu kernels

68873da

Drop icetk dependency

1f34060

Fix position ids expand

19685a5

Synchronize with chatglm 6b repo

7aaf3fe

Fix parallel cpu kernel

7458231

Fix bugs in quantization when loading kernels

dac03c3

Fix Chinese punctuation

debaf00

Update README.md

3ba9437

Update README.md

0d0e806

Update README.md

7ad727c

init commmit

a93efa9

initial commit

62a9758