-
Notifications
You must be signed in to change notification settings - Fork 115
Description
Proposed Content
I would like to propose adding strands-deepgram to the Community Packages documentation.
This is a production-ready speech and audio processing tool for Strands Agents, powered by Deepgram's API. It enables agents to:
- Speech-to-Text: Transcribe audio files with 30+ language support and speaker diarization
- Text-to-Speech: Generate natural-sounding speech with multiple voice options
- Audio Intelligence: Sentiment analysis, topic detection, and intent recognition
- Multi-format Support: WAV, MP3, M4A, FLAC, and more
Package Information:
- PyPI: https://pypi.org/project/strands-deepgram/
- GitHub: https://github.com/eraykeskinmac/strands-deepgram
- Examples: https://github.com/eraykeskinmac/strands-tools-examples
Installation:
pip install strands-deepgram
pip install 'strands-agents[anthropic]'Quick Usage Example:
from strands import Agent
from strands_deepgram import deepgram
agent = Agent(tools=[deepgram])
# Transcribe with speaker identification
agent("transcribe this audio: recording.mp3 with speaker diarization")
# Text-to-speech
agent("convert this text to speech: Hello world")
# Audio intelligence
agent("analyze sentiment in call.wav")Location
Community Packages → Tools → Speech & Audio Processing
This should be added to the "Community Packages" section of the documentation, specifically under the "Tools" category. It would fit alongside other community-contributed tools that extend Strands Agents capabilities.
Rationale
This package would be valuable for the Strands Agents community because:
-
Fills a Critical Gap: Enables AI agents to process voice and audio data, which is essential for modern applications like call analytics, voice assistants, meeting transcriptions, and customer support automation.
-
Production-Ready: Built with comprehensive error handling, follows Strands best practices, supports 30+ languages, and includes real-world examples. Ready to use in production environments.
-
Saves Development Time: Developers can integrate speech processing in minutes instead of spending days building Deepgram integration from scratch.
-
Complete Workflows: Works seamlessly with other community tools (strands-hubspot for CRM lookups, strands-teams for notifications) to create end-to-end workflows like call transcription → customer lookup → team notification.
-
Community Benefit: Open-source with comprehensive documentation and examples. Other developers can learn from it, contribute to it, and build upon it.
-
Following Best Practices: Published on PyPI, follows strands-{thing} naming convention, includes thorough documentation, and demonstrates proper tool implementation patterns.
Content Outline (Optional)
- PyPI Package: https://pypi.org/project/strands-deepgram/
- GitHub Repository: https://github.com/eraykeskinmac/strands-deepgram
- Examples Repository: https://github.com/eraykeskinmac/strands-tools-examples
- Deepgram API Documentation: https://developers.deepgram.com/
- Strands Community Package Guidelines: https://strandsagents.com/docs/community-packages
References
- PyPI Package: https://pypi.org/project/strands-deepgram/
- GitHub Repository: https://github.com/eraykeskinmac/strands-deepgram
- Examples Repository: https://github.com/eraykeskinmac/strands-tools-examples
- Deepgram API Documentation: https://developers.deepgram.com/
- Strands Community Package Guidelines: https://strandsagents.com/latest/documentation/docs/community/community-packages/