Jump to main content Jump to sidebar

Forums
Wiki

Log in
Sign up

/f/MachineLearning

[R] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models - Massachusetts Institute of Technology and NVIDIA Guangxuan Xiao et al - Enables INT8 for LLM bigger than 100B parameters including OPT-175B, BLOOM-176B and GLM-130B.

Submitted by Singularian2501 t3_z1b2rp on November 21, 2022 at 9:37 PM in MachineLearning

13 comments

55

Viewing a single comment thread. View all comments

[deleted] t1_ixc9v6m wrote on November 22, 2022 at 10:49 AM

[removed]

Permalink

1

0 points (+0, −0)

Short URL:

http://ec2-3-131-244-37.us-east-2.compute.amazonaws.com:9999/34834

MachineLearning

t5_2r3gv

Created October 1, 2022
Subscribe via RSS

Toolbox

Bans
Moderation log

Running Postmill