Talk · the vault

Encrypted voice & video — with voice masking.

A phone call is the most natural way to talk and one of the easiest to intercept, record and identify by voice alone. Helix carries voice and global HD video over its own transport, triple-encrypted, and adds real-time voice masking that defeats automated voiceprint identification. Here's how it works — and exactly what voice masking can and can't do.

1. What encrypted calling actually means 2. How triple-encrypted calls work 3. The hard problem: real-time encryption 4. Global HD video on our own transport 5. Voice masking: defeating the voiceprint 6. The threats it stops 7. Why it matters to you 8. How Helix does it 9. The honest limits

1. What encrypted calling actually means

"Encrypted call" is one of the most abused phrases in tech. Plenty of services encrypt your call to their server — the link from your phone to their data center is protected, which stops the café Wi-Fi from listening in. But the provider's server in the middle can still hear everything, because the call is decrypted there to be routed onward. That is transport encryption, not end-to-end encryption, and it leaves the provider — and anyone who can compel or breach the provider — with a clear line to your conversation.

End-to-end encryption is the real thing: the audio and video are encrypted on your device and only decrypted on the device of the person you're talking to. Nobody in between — not the network, not the relays carrying the call, not the company that built the app — can listen. That is the standard Helix holds calls to, and then it goes further, because end-to-end encryption alone still rests on a single cipher and a single key exchange being unbroken. Helix doesn't make that bet.

The distinction is not academic. When a "private" calling feature turns out to be transport-encrypted only, the company holding the middle can be served with a court order, can be breached by an attacker who gets into its data center, can be acquired by a firm with different priorities, or can simply decide to analyze call data for its own purposes. None of those require breaking any cryptography — the plaintext was sitting on the provider's server by design. End-to-end encryption removes that server-side copy entirely: there is no point in the path where the call exists in a form anyone but the two participants can read. That is the difference between "encrypted so the Wi-Fi can't listen" and "encrypted so nobody but us can listen," and it is the difference Helix insists on.

2. How triple-encrypted calls work

Helix calls ride the same three-layer protection as its messaging, adapted for real-time audio. The audio stream is protected by:

The hard part with live audio is doing all of this fast enough that the call still sounds like a normal phone line. Heavy encryption that adds noticeable delay produces the awkward, talking-over-each-other experience that makes people abandon secure tools and reach for the insecure one. Helix is engineered so the protection is effectively invisible: the call connects, the screen confirms the exact moment it's secured end to end, and the audio is crystal-clear with the latency of an ordinary call. Security you can't feel is security you'll actually use.

3. The hard problem: real-time encryption

It is worth dwelling on why secure calling is harder than secure messaging, because it explains a lot of why so many "private" apps either skip calls or do them badly. A text message can take a moment to encrypt, route and deliver — a few hundred milliseconds of delay is imperceptible when you're reading. A live conversation is unforgiving. Human beings notice round-trip delays as small as a couple of hundred milliseconds; past roughly that point, people start talking over each other, pauses feel awkward, and the natural rhythm of a conversation collapses. Add encryption, multi-hop routing and any kind of metadata protection, and every one of those layers wants to add latency.

This is the tension at the heart of secure real-time comms: the protections that make a call private also tend to make it slow, and a slow call is one people abandon for the insecure alternative. The worst outcome in security is a tool so painful that its users route around it. Helix's answer is to engineer the encryption and transport specifically for real-time media — lean protocols, high-bandwidth relays, and a fast lane on the network that prioritizes low latency for live audio and video while still riding the encrypted transport. The audio is encoded, sealed under the triple layer, routed and decoded fast enough that the call feels ordinary. The security is real; the friction is not. That combination — strong protection that you genuinely can't feel — is the only kind that survives contact with daily use.

There is a subtle metadata benefit here too. Because the call rides Helix's own network rather than dialing out over the cellular voice system or a public conferencing service, there is no carrier call-detail record and no third-party log noting that a call happened between two parties at a given time. The fact of the call — not just its contents — stays inside the private network.

4. Global HD video on our own transport

Video raises the bar again. HD video is bandwidth-hungry and latency-sensitive, which is exactly why most "private" messengers either don't offer real video calling or quietly route it through a third-party conferencing service — handing your face, your surroundings and your call metadata to a company you never chose. That third party is a server that can be subpoenaed, a vendor that can be breached, and an SDK that can be mined.

Helix carries HD video to anyone on Helix worldwide over its own transport — the same private network that carries everything else — triple-encrypted per frame, with no conferencing third party in the path. Because the network is built for high bandwidth (the relays move traffic at up to 2.5 Gbit/s on the onion network's fast lane), HD video and big transfers arrive without the stutter people expect from "secure" tools. You get face-to-face, low-latency video that is genuinely private and genuinely usable, instead of choosing one or the other.

"Per frame" is a deliberate phrase. Video is a stream of individual frames; encrypting the stream as one opaque blob would be simpler but more brittle. By sealing the media continuously as it flows, the same forward-secrecy and post-compromise guarantees that protect a text conversation apply across the duration of the call — a key compromised at one instant does not retroactively unlock the minutes that came before, nor automatically the minutes that follow once the ratchet rolls past it. For a long, sensitive call, that means the call's secrecy is not one fragile lock but a continuously renewed one, the same self-healing principle Helix applies to messaging, carried into real-time media.

5. Voice masking: defeating the voiceprint

Encryption protects what you say. It does nothing about a different, increasingly common threat: being identified by how you sound. Your voice is a biometric. Automated speaker-recognition systems can extract a "voiceprint" — a mathematical model of your unique vocal characteristics — and use it to pick you out, confirm it's you on a line, or match you across recordings. Call centers, surveillance systems and fraud-detection platforms all do this routinely now, and the technology is cheap and widespread.

Helix includes real-time voice masking: it alters your voice as you speak so that automated systems cannot match it to a voiceprint. The processing happens live, on the call, so the person you're talking to hears a natural conversation while an automated identification system fed the same audio fails to find you in it. For anyone who needs to speak without their voice itself becoming an identifier, this closes a gap that encryption simply does not address — because the threat was never the content, it was the timbre.

This is the one feature where we have to be precise about its limit, because overstating it would be dangerous. Voice masking defeats automated voiceprint identification. It does not fool a human who already knows how you sound.

That distinction is the whole truth of the feature. A machine comparing your masked audio against a stored voiceprint will fail to match — the mathematical fingerprint it relies on has been altered out from under it. But a person who knows your voice — a colleague, a relative, an interrogator who has heard you before — may well recognize cadence, accent, word choice and manner that a voiceprint algorithm doesn't weigh the same way. Voice masking is a strong defense against the scalable, automated identification that watches everyone at once. It is not a disguise against someone who already knows you. We say so plainly because a tool that promised the latter would get someone hurt.

It helps to understand what an automated voiceprint system actually keys on. These systems don't store a recording of you saying a particular phrase; they extract statistical features of your vocal tract and speaking style — pitch range, the resonant frequencies your throat and mouth produce, the way energy is distributed across the spectrum — and reduce them to a compact mathematical signature. Real-time voice masking works by transforming those underlying acoustic features as you speak, shifting the signature far enough that it no longer lands near your stored voiceprint, while keeping the speech intelligible and natural to a human listener on the other end. The machine is comparing fingerprints; masking changes the fingerprint without changing the words.

6. The threats it stops

Putting it together, encrypted calls with voice masking defend against three distinct things, each a different kind of attacker:

And, importantly, it is honest about the threat it does not stop: a human who knows your voice. Naming that limit is part of making the tool safe to rely on.

7. Why it matters to you

Truly private calling — and the option to obscure your voice as a biometric — matters most to the people whose voice, face and the simple fact of a call are themselves sensitive:

8. How Helix does it

Voice and video are part of the Helix suite, riding the same private network and the same three-layer post-quantum encryption as messaging and files. There is no conferencing third party: HD video reaches anyone on Helix worldwide over Helix's own transport, encrypted per frame, with the screen confirming the moment the call is secured. Voice masking is available in real time on calls. The same infrastructure also carries the built-in VPN and the onion-routed metadata protection, so a private call isn't just encrypted content — the pattern of who called whom is protected too.

As with everything in Helix, calls assume the device underneath is also defended. A perfectly encrypted, voice-masked call is still readable to an attacker who has compromised the phone and is capturing the microphone before encryption — which is why Helix pairs calls with a device-level shield that watches for exactly that, and is honest that detection is a strong signal rather than a guarantee.

9. The honest limits

The most important limit here is the one we lead with, because getting it wrong could put someone at risk:

Within those honest boundaries, the goal is calling that sounds normal, looks sharp, stays genuinely between the two endpoints, and gives you the option to keep your voice from becoming a machine-readable identifier.

Encryption protects what you say. Voice masking protects how you sound — from automated voiceprint systems, not from someone who already knows your voice. We won't pretend otherwise.
Get Helix — from $199/moSee every feature