Lossless LLM compression for efficient GPU inference via dynamic-length float (arxiv.org)
116 points by CharlesW 2 hours ago | 30 comments
1116 points by CharlesW 2 hours ago | 30 comments
137 points by hyperbrainer an hour ago | 5 comments
2379 points by scalewithlee 6 hours ago | 221 comments
370 points by anerli 3 hours ago | 23 comments
4197 points by po 7 hours ago | 70 comments
522 points by ndrwnaguib an hour ago | 7 comments
678 points by bookofjoe 5 hours ago | 13 comments
7564 points by kwindla 5 hours ago | 544 comments
87 points by lermontov an hour ago | 1 comment
960 points by luu 12 hours ago | 10 comments
1017 points by Jerry2 2 hours ago | 1 comment
113 hours ago
126 points by todsacerdoti 4 hours ago | 0 comments
13175 points by jacobr1 7 hours ago | 145 comments
14230 points by NaOH 21 hours ago | 59 comments
1547 points by zdw 2 days ago | 5 comments
1631 points by todsacerdoti 2 days ago | 9 comments
17285 points by susam 17 hours ago | 79 comments
1877 points by rntn 4 hours ago | 43 comments
1919 points by janwilmake 5 hours ago | 1 comment
2016 points by solarmist an hour ago | 6 comments
2116 points by samclemens 14 hours ago | 0 comments
22252 points by gnabgib a day ago | 103 comments
2338 points by samclemens 14 hours ago | 17 comments
24172 points by mpweiher 14 hours ago | 161 comments
25746 points by ekiauhce a day ago | 541 comments
2638 points by ksec 4 hours ago | 1 comment
2722 points by hilux 3 hours ago | 13 comments
2890 points by godzie 6 hours ago | 14 comments
2968 points by luu 3 days ago | 42 comments
30