Skip to content

Popular repositories Loading

  1. rmbg-1.4 rmbg-1.4 Public template

    State-of-the-art background removal model, designed to effectively separate foreground from background. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata>

    Python 23 13

  2. triton-co-pilot triton-co-pilot Public

    Generate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments

    Python 20 3

  3. whisper-large-v3 whisper-large-v3 Public template

    State‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata>

    Python 17 17

  4. smaug-72b smaug-72b Public

    Smaug-72B topped the Hugging Face LLM leaderboard and it’s the first model with an average score of 80, making it the world’s best open-source foundation model. <metadata> gpu: A100 | collections: …

    Python 17 4

  5. qwq-32b-preview qwq-32b-preview Public template

    A 32B experimental reasoning model for advanced text generation and robust instruction following. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>

    Python 17 7

  6. deepseek-r1-distill-qwen-32b deepseek-r1-distill-qwen-32b Public template

    A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>

    Python 15 39

Repositories

Showing 10 of 186 repositories
  • chatterbox Public template

    Chatterbox is an TTS by Resemble AI featuring emotion exaggeration control, zero-shot voice cloning, alignment-informed real-time synthesis, and built-in PerTh neural watermarking for responsible, high-quality speech generation audio. <metadata> gpu: A10 | collections: ["HF_Transformers"] </metadata>

    inferless/chatterbox’s past year of commit activity
    Python 0 3 0 0 Updated Aug 29, 2025
  • inferless/mistral-small-3.2-24b-instruct’s past year of commit activity
    Python 0 0 0 0 Updated Aug 29, 2025
  • qwen3-30b-a3b-instruct-2507 Public template

    30.5B MoE language model from Qwen team, tuned for broad instruction following, reasoning, multilingual tasks, and agentic tool use.<metadata> gpu: A100 | collections: ["HF_Transformers"] </metadata>

    inferless/qwen3-30b-a3b-instruct-2507’s past year of commit activity
    Python 0 2 0 0 Updated Aug 29, 2025
  • 0 2 0 0 Updated Aug 29, 2025
  • flux-1-krea-dev Public template

    12B model distilled from Krea 1, designed to deliver highly photorealistic results. <metadata> gpu: A100 | collections: ["HF_Transformers"] </metadata>

    inferless/flux-1-krea-dev’s past year of commit activity
    Python 0 2 0 0 Updated Aug 29, 2025
  • inferless/code-debugging-agent’s past year of commit activity
    Python 0 0 0 0 Updated Aug 28, 2025
  • dia-1.6b Public
    inferless/dia-1.6b’s past year of commit activity
    0 0 0 0 Updated Aug 18, 2025
  • qwen-image Public
    inferless/qwen-image’s past year of commit activity
    Python 0 0 0 0 Updated Aug 14, 2025
  • pyannote-speaker-diarization-3.1 Public template

    A state-of-the-art model that segments and labels audio recordings by accurately distinguishing different speakers. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata>

    inferless/pyannote-speaker-diarization-3.1’s past year of commit activity
    Python 5 3 0 0 Updated Aug 14, 2025
  • facebook-bart-cnn Public template

    A variant of the BART model designed specifically for natural language summarization. It was pre-trained on a large corpus of English text and later fine-tuned on the CNN/Daily Mail dataset. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata>

    inferless/facebook-bart-cnn’s past year of commit activity
    Python 10 3 0 1 Updated Aug 14, 2025

Top languages

Loading…

Most used topics

Loading…