Sarcouncil Journal of Multidisciplinary
Sarcouncil Journal of Multidisciplinary
An Open access peer reviewed international Journal
Publication Frequency- Monthly
Publisher Name-SARC Publisher
ISSN Online- 2945-3445
Country of origin- PHILIPPINES
Frequency- 3.6
Language- English
Keywords
- Social sciences, Medical sciences, Engineering, Biology
Editors

Dr Hazim Abdul-Rahman
Associate Editor
Sarcouncil Journal of Applied Sciences

Entessar Al Jbawi
Associate Editor
Sarcouncil Journal of Multidisciplinary

Rishabh Rajesh Shanbhag
Associate Editor
Sarcouncil Journal of Engineering and Computer Sciences

Dr Md. Rezowan ur Rahman
Associate Editor
Sarcouncil Journal of Biomedical Sciences

Dr Ifeoma Christy
Associate Editor
Sarcouncil Journal of Entrepreneurship And Business Management
Scalable MCP Server-Client Architecture with FastMCP in Microservices
Keywords: Micro service architecture, Context management, Token-based references, Distributed caching, Asynchronous APIs.
Abstract: Fast MCP represents a significant advancement in context management for micro service architectures, addressing the critical challenge of maintaining contextual integrity across service boundaries. The exponential growth of distributed computing has created an urgent need for efficient context propagation mechanisms, with traditional implementations struggling under high loads. Fast MCP leverages token-based referencing and Redis-backed persistence to dramatically reduce network payload sizes while maintaining high throughput and low latency. The architecture comprises four key components: Context Registry, Session Manager, Context Serializer, and Distribution Layer, working in concert to ensure context preservation while maintaining loose coupling. Implementation using Fast API provides an ideal asynchronous foundation for non-blocking operations, with endpoints optimized for creation, retrieval, and modification of context data. The tokenization approach generates compact, unique identifiers that reference context objects stored in Redis, significantly reducing network overhead and improving request/response times. Additionally, Fast MCP incorporates Server-Sent Events (SSE) and streaming HTTP capabilities to enable real-time context synchronization for Large Language Model (LLM) applications, with Redis serving as both a session tracker and streaming state manager for persistent conversational contexts. Performance testing confirms Fast MCP's substantial advantages in latency, throughput, resource utilization, and scalability across diverse deployment scenarios, from machine learning pipelines to transaction processing systems. The system demonstrates near-linear scalability for read operations when deployed in cluster configurations, with graceful degradation under extreme loads prioritizing availability over strict consistency.
Author
- Manmohan Alla
- Glasgow Caledonian University UK