Real-Time Voice/Media Engineer
7 days ago
Real-Time Voice/Media Engineer (Twilio & Latency)
Job Summary
Synechron seeks a proficient Real-Time Voice/Media Engineer with expertise in Twilio, WebRTC, and low-latency media streams to support telephony and voice applications. This role is essential for building and maintaining real-time communication pipelines that deliver high-quality audio with minimal latency, enabling seamless customer interactions and reliable voice solutions. The engineer will focus on optimizing media handling, telemetry, and orchestration, ensuring the delivery of efficient, scalable, and robust real-time voice features aligned with organizational goals.
Software Requirements
Required Skills:
-
Twilio PSTN & WebRTC: Expertise in integrating Twilio voice services including PSTN connectivity, WebRTC signaling, and media streams.
-
Media Stream Orchestration (WebSocket): Experience in orchestrating real-time media and signaling using WebSocket protocols for low-latency communication.
-
Real-Time Audio Pipelines: Design and implementation of high-performance audio pathways, including barge-in and turn-taking mechanisms.
-
Telemetry & Latency Monitoring: Implementing and analyzing telemetry for each stage of voice processing (ASR, LLM, TTS) to monitor and optimize latency.
-
Multilingual Speech Configurations: Experience configuring speech systems with support for Spanish and English, including handling speech recognition and synthesis.
Preferred Skills:
-
Familiarity with additional telephony platforms or media services beyond Twilio.
-
Knowledge of media encoding standards, codecs, and jitter buffer management.
Overall Responsibilities
-
Develop and optimize real-time voice and media streaming solutions using Twilio and WebRTC technologies.
-
Implement telemetry and monitoring tools to measure and reduce latency at various stages of voice processing, including speech recognition and synthesis.
-
Manage media orchestration and signaling flow using WebSocket channels to ensure smooth interactions, barge-in, and turn-taking functionality.
-
Collaborate with software engineering teams to integrate voice features into broader customer engagement solutions.
-
Maintain high-quality voice streams by troubleshooting issues related to latency, jitter, packet loss, and network disruptions.
-
Configure and tune telephony components to meet reliability, scalability, and security standards.
-
Continuously monitor system performance, providing insights and improvements for latency reduction and user experience enhancement.
-
Document configurations, protocols, and procedures for current and future reference.
Technical Skills (By Category)
Programming Languages:
-
Required: JavaScript/TypeScript (WebRTC signaling, front-end integrations), familiarity with scripting languages (e.g., Python, Bash) for automation.
-
Preferred: Experience with low-level media processing languages (C/C ).
Databases / Data Management:
-
Knowledge of logging and telemetry data storage in systems such as Elasticsearch or similar for real-time analysis.
Cloud Technologies:
-
Experience deploying and managing telephony and media solutions on cloud platforms, especially AWS or Azure.
Frameworks and Libraries:
-
WebRTC, Twilio SDKs, real-time WebSocket libraries (e.g., Socket.IO).
Development Tools & Methodologies:
-
Version control: Git, GitHub or Azure DevOps.
-
Agile development practices, continuous integration/deployment processes.
Security Protocols:
-
TLS encryption for signaling and media streams.
-
Ensuring compliance with telephony security standards and privacy policies.
Monitoring & Telemetry:
-
OpenTelemetry, Azure Monitor, Prometheus, or Grafana for observation and latency analytics.
Experience Requirements
-
At least 3-5 years of experience working with real-time voice communications, media streams, or telephony systems.
-
Proven experience configuring and managing Twilio Voice, WebRTC implementations, or similar platforms.
-
Experience monitoring, analyzing, and optimizing low-latency communication systems, including telemetry for latency measurement.
-
Familiarity with multilingual speech recognition and synthesis in commercial or custom environments is advantageous.
-
Prior work in contact center, customer engagement, or voice-enabled applications preferred.
Day-to-Day Activities
-
Developing and refining real-time media pipelines to enable low-latency voice communication.
-
Configuring and managing media signaling via WebSocket connections, ensuring efficient session management.
-
Building telemetry and monitoring dashboards to track latency, jitter, packet loss, and other performance metrics.
-
Troubleshooting audio quality issues, latency spikes, and media disruptions, implementing solutions promptly.
-
Collaborating with product teams and developers to integrate voice features and ensure quality standards.
-
Performing regular system tests, tuning buffer sizes, and adjusting configurations to optimize performance.
-
Documenting system architectures, protocols, and operational procedures.
-
Participating in backlog grooming, sprint planning, and review meetings to align technical development with business needs.
Qualifications
-
Educational: Bachelor's degree or higher in Computer Science, Electrical Engineering, Telecommunications, or related fields. Equivalent practical experience is acceptable.
-
Certifications: Relevant telephony, VoIP, or WebRTC certifications (e.g., Twilio Certified, WebRTC Developer) are a plus.
-
Training & Development: Commitment to ongoing learning in voice technology, latency optimization, and network security.
Professional Competencies
-
Analytical mindset with strong troubleshooting and problem-solving skills.
-
Ability to interpret telemetry data and translate insights into actionable improvements.
-
Excellent communication skills for technical collaboration and documentation.
-
Team-oriented approach with experience working across cross-functional teams.
-
Adaptability to evolving voice and conferencing technologies, with a focus on security and performance.
-
Ability to manage multiple priorities and deliver solutions within specified timelines.
SYNECHRON'S DIVERSITY & INCLUSION STATEMENT
Diversity & Inclusion are fundamental to our culture, and Synechron is proud to be an equal opportunity workplace and is an affirmative action employer. Our Diversity, Equity, and Inclusion (DEI) initiative 'Same Difference' is committed to fostering an inclusive culture – promoting equality, diversity and an environment that is respectful to all. We strongly believe that a diverse workforce helps build stronger, successful businesses as a global company. We encourage applicants from across diverse backgrounds, race, ethnicities, religion, age, marital status, gender, sexual orientations, or disabilities to apply. We empower our global workforce by offering flexible workplace arrangements, mentoring, internal mobility, learning and development programs, and more.
All employment decisions at Synechron are based on business needs, job requirements and individual qualifications, without regard to the applicant's gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law.
Candidate Application Notice
Experience LevelSenior Level-
Real-Time Voice/ Media Engineer
2 days ago
Pune, Maharashtra, India Synechron Full time ₹ 20,00,000 - ₹ 25,00,000 per yearSynechron seeks a proficient Real-Time Voice/Media Engineer with expertise in Twilio, WebRTC, and low-latency media streams to support telephony and voice applications. This role is essential for building and maintaining real-time communication pipelines that deliver high-quality audio with minimal latency, enabling seamless customer interactions and...
-
AI Voice Agent Engineer
1 week ago
Pune, India JobTalk AI Full timeJob Description We are building next-generation AI Voice Agents for recruiters. You'll design and implement real-time, low-latency conversational systems that combine LLMs, STT, TTS, RAG, and streaming pipelines. The backend runs in Python, with a React/Node.js frontend. If you have built real-time voice systems that can handle human interruptions and...
-
Speech AI Engineer
2 weeks ago
India The Future of Voice Full timeOverviewWe’re building the future of voice — intelligent, expressive, multilingual systems that can understand, respond, and connect with humans naturally.As our Speech AI Engineer, you’ll be part of a high-performance team designing voice intelligence that powers conversational platforms for governments, enterprises, and next-gen contact...
-
Speech AI Engineer
2 weeks ago
India The Future of Voice Full timeOverview We’re building the future of voice — intelligent, expressive, multilingual systems that can understand, respond, and connect with humans naturally. As our Speech AI Engineer , you’ll be part of a high-performance team designing voice intelligence that powers conversational platforms for governments, enterprises, and next-gen contact...
-
Speech AI Engineer
2 weeks ago
India The Future of Voice Full timeJob Description Overview We're building the future of voice intelligent, expressive, multilingual systems that can understand, respond, and connect with humans naturally. As ourSpeech AI Engineer, you'll be part of a high-performance team designingvoice intelligencethat powers conversational platforms for governments, enterprises, and next-gen contact...
-
AI Voice Agent Engineer
3 weeks ago
Pune, India JobTalk AI Full timeWe are building next-generation AI Voice Agents for recruiters . You’ll design and implement real-time, low-latency conversational systems that combine LLMs, STT, TTS, RAG , and streaming pipelines. The backend runs in Python , with a React/Node.js frontend. If you have built real-time voice systems that can handle human interruptions and dynamic context,...
-
AI Voice Agent Engineer
3 weeks ago
Pune, India JobTalk AI Full timeWe are building next-generation AI Voice Agents for recruiters . You’ll design and implement real-time, low-latency conversational systems that combine LLMs, STT, TTS, RAG , and streaming pipelines. The backend runs in Python , with a React/Node.js frontend. If you have built real-time voice systems that can handle human interruptions and dynamic context,...
-
AI Voice Agent Engineer
4 weeks ago
Pune, India JobTalk AI Full timeWe are building next-generation AI Voice Agents for recruiters . You’ll design and implement real-time, low-latency conversational systems that combine LLMs, STT, TTS, RAG , and streaming pipelines. The backend runs in Python , with a React/Node.js frontend. If you have built real-time voice systems that can handle human interruptions and dynamic context,...
-
AI Voice Agent Engineer
3 weeks ago
Pune, India JobTalk AI Full timeWe are building next-generation AI Voice Agents for recruiters. You’ll design and implement real-time, low-latency conversational systems that combine LLMs, STT, TTS, RAG, and streaming pipelines. The backend runs in Python, with a React/Node.js frontend. If you have built real-time voice systems that can handle human interruptions and dynamic context,...
-
Ai voice agent engineer
2 weeks ago
Pune, India JobTalk AI Full timeWe are building next-generation AI Voice Agents for recruiters. You’ll design and implement real-time, low-latency conversational systems that combine LLMs, STT, TTS, RAG, and streaming pipelines. The backend runs in Python, with a React/Node.js frontend. If you have built real-time voice systems that can handle human interruptions and dynamic context,...