TranscriptionTools-MCP
by: MushroomFleet
An MCP server providing intelligent transcript processing capabilities, featuring natural formatting, contextual repair, and smart summarization powered by Deep Thinking LLMs.
πOverview
Purpose: This framework aims to provide advanced transcription processing capabilities through intelligent repair, formatting, and summarization.
Overview: The TranscriptionTools MCP Server harnesses Deep Thinking LLMs to deliver sophisticated transcript processing, including natural text formatting, contextual error correction, and smart summarization.
Key Features:
-
Repair Text: Analyzes and fixes transcription errors with over 90% confidence, ensuring accurate representations of spoken content.
-
Format Transcript: Converts timestamped transcripts into naturally flowing text, enhancing readability while maintaining original speech patterns.
-
Get Repair Log: Retrieves analytical logs detailing previous transcription repairs to review changes made.
-
Summary Text: Generates concise summaries of transcripts using ACE cognitive methodology, preserving key information and context in various formats.
TranscriptionTools MCP Server
An MCP server providing intelligent transcript processing capabilities, featuring natural formatting, contextual repair, and smart summarization powered by Deep Thinking LLMs.
Available MCP Tools
This MCP server exposes four powerful tools for transcript processing:
- repair_text - Analyzes and repairs transcription errors with greater than 90% confidence
- get_repair_log - Retrieves detailed analysis logs from previous repairs
- format_transcript - Transforms timestamped transcripts into naturally formatted text
- summary_text - Generates intelligent summaries using ACE cognitive methodology
Installation
Installing via Smithery
To install Transcription Tools for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @MushroomFleet/transcriptiontools-mcp --client claude
- Clone this repository:
git clone https://github.com/mushroomfleet/TranscriptionTools-MCP
cd TranscriptionTools-MCP
- Install dependencies:
npm install
- Build the server:
npm run build
- Configure the MCP server in your MCP settings file:
{
"mcpServers": {
"transcription-tools": {
"command": "node",
"args": ["/path/to/TranscriptionTools-MCP/build/index.js"],
"disabled": false,
"autoApprove": []
}
}
}
Using the MCP Tools
Repairing Transcription Errors
<use_mcp_tool>
<server_name>transcription-tools</server_name>
<tool_name>repair_text</tool_name>
<arguments>
{
"input_text": "We recieve about ten thousand dollars which is defiantly not enough.",
"is_file_path": false
}
</arguments>
</use_mcp_tool>
Formatting Timestamped Transcripts
<use_mcp_tool>
<server_name>transcription-tools</server_name>
<tool_name>format_transcript</tool_name>
<arguments>
{
"input_text": "/path/to/timestamped-transcript.txt",
"is_file_path": true,
"paragraph_gap": 8,
"line_gap": 4
}
</arguments>
</use_mcp_tool>
Generating Summaries
<use_mcp_tool>
<server_name>transcription-tools</server_name>
<tool_name>summary_text</tool_name>
<arguments>
{
"input_text": "Long text to summarize...",
"is_file_path": false,
"constraint_type": "words",
"constraint_value": 100
}
</arguments>
</use_mcp_tool>
Retrieving Repair Logs
<use_mcp_tool>
<server_name>transcription-tools</server_name>
<tool_name>get_repair_log</tool_name>
<arguments>
{
"session_id": "20241206143022"
}
</arguments>
</use_mcp_tool>
Core Technologies
Natural Formatting
- Removes timestamps while preserving speech patterns
- Applies intelligent spacing based on pause duration
- Respects natural grammar and language flow
- Maintains exact transcribed content
Contextual Repair
- Identifies and corrects likely transcription errors
- Uses semantic context for high-confidence corrections
- Maintains detailed logs of all changes
- 90% confidence threshold for corrections
- No original audio required
Smart Summarization
- Creates concise summaries of processed transcripts
- Supports multiple constraint types:
- Time-based (speaking duration)
- Character count
- Word count
- Preserves key information and context
- Maintains natural speaking rhythm
Project Structure
/
βββ .gitignore
βββ LICENSE
βββ README.md
βββ package.json
βββ tsconfig.json
βββ build/
β βββ tools/
β βββ utils/
βββ src/
βββ index.ts
βββ tools/
β βββ formatting.ts
β βββ repair.ts
β βββ summary.ts
βββ utils/
βββ file-handler.ts
βββ logger.ts
Configuration
You can customize the server behavior by modifying the source code directly. Key configuration parameters are found in the respective tool implementation files:
// In src/tools/formatting.ts
const paragraph_gap = 8; // seconds
const line_gap = 4; // seconds
// In src/tools/repair.ts
const confidence_threshold = 90; // percentage
// In src/tools/summary.ts
const default_speaking_pace = 150; // words per minute
License
MIT