Skip to content

Conversation

@n1ck-guo
Copy link
Contributor

No description provided.

Signed-off-by: n1ck-guo <heng.guo@intel.com>
self.backend = format.split(":")[1] if ":" in format else None

if self.backend == "llm_compressor":
self.support_schemes = ["MXFP4", "MXFP8", "NVFP4", "FPW8A16", "FP8_STATIC"]
Copy link
Contributor

@wenhuach21 wenhuach21 Oct 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

decouple backend from format . auto-gtpq, auto-awq, llm-compressor, auto-round
Each backend is a class, has its support schemes and provide funcitons, pack_layer, pack_model

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

as we are going to support layer-wise datatype

@wenhuach21 wenhuach21 marked this pull request as draft October 17, 2025 09:10
@wenhuach21
Copy link
Contributor

convert to draft now due to shortage of ut machines

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants