wz-phone

Author	SHA1	Message	Date
Siavash Sameni	27bc264738	feat(codec): Phase 3b — CallDecoder DRED reconstruction on packet loss Phase 3b of the DRED integration — wires the Phase 3a FFI primitives into the desktop receive path. When the jitter buffer reports a missing Opus frame, CallDecoder now attempts to reconstruct the audio from the most recently parsed DRED side-channel state before falling through to classical PLC. Architectural refinement vs the PRD's literal wording: the PRD said "jitter buffer takes a Box<dyn DredReconstructor>". After checking deps, wzp-transport depends only on wzp-proto (not wzp-codec). Putting DRED state in the jitter buffer would require a new cross-crate dep and couple the codec-agnostic buffer to libopus. Instead, this commit keeps the DRED state ring and reconstruction dispatch inside CallDecoder (one layer up from the jitter buffer), intercepting the existing PlayoutResult::Missing signal. Same lookahead/backfill semantics, cleaner layering, zero change to wzp-transport. Changes: CallDecoder field type: Box<dyn AudioDecoder> → AdaptiveDecoder. Required because Phase 3b calls the inherent reconstruct_from_dred method, which cannot live on the AudioDecoder trait without dragging libopus DredState through wzp-proto. In practice AdaptiveDecoder was the only AudioDecoder implementor anyway — the trait abstraction was buying nothing. Method call sites unchanged because AdaptiveDecoder also implements AudioDecoder. New CallDecoder fields: - dred_decoder: DredDecoderHandle - dred_parse_scratch: DredState (scratch for parse_into) - last_good_dred: DredState (cached most-recent valid state) - last_good_dred_seq: Option<u16> - dred_reconstructions: u64 (Phase 4 telemetry) - classical_plc_invocations: u64 (Phase 4 telemetry) CallDecoder::ingest — on Opus non-repair packets, parse DRED into the scratch state. On success (samples_available > 0), std::mem::swap the scratch into last_good_dred and record the seq. This is O(1) per packet, zero allocation after construction (the two DredState buffers are allocated once in new() and reused forever). CallDecoder::decode_next — on PlayoutResult::Missing(seq) for Opus profiles: if last_good_dred_seq > seq and the seq delta × frame_samples fits within samples_available, call audio_dec.reconstruct_from_dred and bump dred_reconstructions. Otherwise fall through to classical PLC and bump classical_plc_invocations. The Codec2 path always falls through to classical PLC since DRED is libopus-only and AdaptiveDecoder::reconstruct_from_dred rejects Codec2 tiers explicitly. OpusDecoder and AdaptiveDecoder: new inherent reconstruct_from_dred method that delegates to the underlying DecoderHandle. Needed to bridge CallDecoder's wzp-client code to the Phase 3a FFI wrappers without touching the AudioDecoder trait. CRITICAL FINDING — raised DRED loss floor from 5% to 15%: Phase 3b testing discovered that libopus 1.5's DRED emission window scales aggressively with OPUS_SET_PACKET_LOSS_PERC. Empirical data (see probe_dred_samples_available_by_loss_floor, an #[ignore]'d diagnostic test in call.rs): loss_pct samples_available effective_ms 5% 720 15 ms (useless!) 10% 2640 55 ms 15% 4560 95 ms 20% 6480 135 ms 25%+ 8400 (capped) 175 ms (~87% of 200 ms configured) The Phase 1 default of 5% produced only a 15 ms reconstruction window — too small to even cover a single 20 ms Opus frame. DRED was effectively disabled even though it was emitting bytes. Raised the floor to 15% (95 ms window) as the minimum that actually provides single-frame loss recovery. This updates Phase 1's DRED_LOSS_FLOOR_PCT constant in opus_enc.rs and the accompanying module docstring. Trade-off: 15% assumed loss slightly increases encoder bitrate overhead on clean networks. Measured via the existing phase1 bitrate probe: Before (5% floor): 3649 bytes/sec at Opus 24k + 300 Hz sine After (15% floor): 3568 bytes/sec at Opus 24k + 300 Hz sine The delta is within noise — 15% isn't meaningfully more expensive than 5% on this signal, which suggests the DRED emission size is signal- dependent rather than loss-dependent for small values. Net result: we get a 6x larger reconstruction window for essentially free. Tests (+3 DRED recovery, +1 #[ignore]'d probe): - opus_single_packet_loss_is_recovered_via_dred — full encode → ingest → decode_next loop with one packet dropped mid-stream. Asserts dred_reconstructions ≥ 1 and observes the exact counter deltas. - opus_lossless_ingest_never_triggers_dred_or_plc — baseline behavior, lossless stream never takes the Missing branch. - codec2_loss_falls_through_to_classical_plc — Codec2 never reconstructs via DRED even if state were populated (which it won't be — Codec2 packets don't carry DRED bytes). - probe_dred_samples_available_by_loss_floor — #[ignore]'d diagnostic that sweeps loss_pct values and prints the resulting DRED window sizes. Kept for future tuning work. New CallDecoder introspection accessors (public but undocumented in the PRD): last_good_dred_seq() and last_good_dred_samples_available() for test diagnostics and future telemetry surfaces in Phase 4. Verification: - cargo check --workspace: zero errors - cargo test -p wzp-codec --lib: 68 passing (Phase 3a baseline held) - cargo test -p wzp-client --lib: 35 passing (+3 Phase 3b tests, +1 ignored diagnostic, no regressions) Next up: Phase 3c mirrors this on the Android engine.rs receive path. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 20:03:24 +04:00
Siavash Sameni	54cbebd34e	feat(codec): Phase 1 — enable DRED on all Opus profiles, disable inband FEC Phase 1 of the DRED integration (docs/PRD-dred-integration.md). The Opus encoder now emits DRED (Deep REDundancy) bytes in every packet, carrying a neural-coded history of recent audio that the decoder can use to reconstruct loss bursts up to the configured window. Opus inband FEC (LBRR) is disabled because DRED does the same job better and running both wastes bitrate on overlapping protection. Tiered DRED duration policy per PRD: Studio (Opus 32k/48k/64k): 10 frames = 100 ms Normal (Opus 16k/24k): 20 frames = 200 ms Degraded (Opus 6k): 50 frames = 500 ms Each profile switch (via adaptive quality) updates the DRED duration to match the new tier. A 5% packet_loss floor is applied whenever DRED is active, because libopus 1.5 gates DRED emission on non-zero packet_loss. Real loss measurements from the quality adapter override upward. Escape hatch: AUDIO_USE_LEGACY_FEC=1 reverts the encoder to Phase 0 behavior (inband FEC Mode1, DRED off, no loss floor). Read once at OpusEncoder::new; call-scoped, not re-read mid-call. Trait-level set_inband_fec becomes a no-op in DRED mode to preserve the invariant even if external callers forget. Observations from the bitrate probe test (dred_mode_roundtrip_voice_pattern): DRED mode: 3649 bytes/sec (~29.2 kbps) on Opus 24k + 300 Hz sine Legacy mode: 2383 bytes/sec (~19.1 kbps) Delta: +10.1 kbps The delta is considerably larger than the "+1 kbps flat" figure I carried into the PRD from hazy memory of published DRED benchmarks. Likely because the input (300 Hz sine) is very compressible so the base Opus rate in legacy mode is well below the 24 kbps target, making the delta look disproportionate. Signal-dependent — real speech would probably show a different ratio. If production telemetry shows the overhead is excessive, we can cut DRED duration on the normal tier from 200 ms to 100 ms as a first tuning lever. Not blocking Phase 1 since the test still passes within the reasonable 2000–8000 bytes/sec bounds. Test changes (+8 tests, total wzp-codec: 61 passing): - dred_duration_for_studio_tiers_is_100ms (per-profile policy) - dred_duration_for_normal_tiers_is_200ms - dred_duration_for_degraded_tier_is_500ms - dred_duration_for_codec2_is_zero - default_mode_is_dred_not_legacy (sanity check on fresh construction) - dred_mode_roundtrip_voice_pattern (observes DRED bitrate, asserts bounds) - profile_switch_refreshes_dred_duration (verifies set_profile updates DRED) - set_inband_fec_noop_in_dred_mode (trait-level inband FEC no-op) Verification: - cargo check --workspace: zero errors, no new warnings - cargo test -p wzp-codec: 61/61 passing (53 pre-Phase-1 baseline + 8 new) - Empirical DRED bitrate observed via `rtk proxy cargo test dred_mode_roundtrip_voice_pattern -- --nocapture` Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 20:02:35 +04:00
Siavash Sameni	86526a7ad4	feat(codec): Phase 0 — swap audiopus → opusic-c + opusic-sys (libopus 1.5.2) Phase 0 of the DRED integration (docs/PRD-dred-integration.md). No behavior change: inband FEC stays ON, no DRED, same bitrate, same quality. This commit unblocks Phase 1+ by getting us onto libopus 1.5.2 where DRED lives. Rationale for going straight to a custom DecoderHandle: opusic-c::Decoder's inner mut OpusDecoder pointer is pub(crate), so we cannot reach it for the Phase 3 DRED reconstruction path. Running two parallel decoders (one for audio, one for DRED) would drift because the DRED decoder wouldn't see normal decode calls. Single unified DecoderHandle over raw opusic-sys is the only correct architecture, so we build it in Phase 0 rather than rewriting opus_dec.rs twice. Changes: - Cargo.toml (workspace + wzp-codec): remove audiopus 0.3.0-rc.0, add opusic-c 1.5.5 (bundled + dred features), opusic-sys 0.6.0 (bundled), bytemuck 1. Pinned exactly for reproducible libopus 1.5.2. - opus_enc.rs: rewritten against opusic_c::Encoder. Argument order for Encoder::new swapped (Channels first). set_inband_fec(bool) now maps to InbandFec::Mode1 (the libopus 1.5 equivalent of 1.3's LBRR). encode uses bytemuck::cast_slice<i16,u16> at the &[u16] boundary. - dred_ffi.rs (new): DecoderHandle wrapping mut OpusDecoder directly via opusic-sys. Owns the allocation, frees on Drop. Exposes decode, decode_lost, and a pub(crate) as_raw_ptr() for the future Phase 3 DRED reconstruction. Send+Sync justified via &mut self access discipline. - opus_dec.rs: rewritten as a thin AudioDecoder impl over DecoderHandle. Behavior identical to pre-swap. Verification (Phase 0 acceptance gates): - cargo check --workspace: clean (30 pre-existing warnings in jni_bridge.rs unrelated to this work; zero in changed files). - cargo test -p wzp-codec: 53 tests pass (50 pre-swap + 6 new: 3 in dred_ffi.rs for DecoderHandle lifecycle, 3 in opus_enc.rs for version check and roundtrip). - linked_libopus_is_1_5 test asserts opusic_c::version() contains "1.5" — hard signal that the swap landed correctly. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 20:02:15 +04:00
Siavash Sameni	a8c2011445	feat: add Opus 32k/48k/64k studio quality tiers Some checks failed Mirror to GitHub / mirror (push) Failing after 36s Details Build Release Binaries / build-amd64 (push) Has been cancelled Details Adds three new codec IDs (Opus32k=6, Opus48k=7, Opus64k=8) and corresponding STUDIO_32K, STUDIO_48K, STUDIO_64K quality profiles. All use 20ms frames with minimal FEC (10%) for maximum quality on good networks. Updated across: wire protocol (codec_id.rs), encoder/decoder (opus_enc/dec.rs), adaptive codec switch (call.rs), CLI (--profile studio-64k), desktop engine + UI slider (8 quality levels from Studio 64k green to Codec2 1.2k red). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 18:31:05 +04:00
Claude	26e9c55f1f	feat: Android VoIP client — Phase 1 (audio quality, network adaptation, crate skeleton) - New wzp-android crate with Oboe C++ backend, lock-free SPSC ring buffers, engine orchestrator, codec pipeline, and Android Gradle project structure - AEC (NLMS adaptive filter), AGC (two-stage with fast attack/slow release), windowed-sinc FIR resampler replacing linear interpolation (wzp-codec) - Opus encoder tuning: complexity 7 default, set_expected_loss support - Mobile jitter buffer: asymmetric EMA (fast up/slow down), handoff spike detection with 2s cooldown, configurable safety margin - Network-aware quality control: cellular-specific thresholds, faster downgrade on cellular, proactive tier drop on WiFi→cellular handoff, FEC ratio boost during network transitions - Handoff detection in PathMonitor via RTT jitter spike analysis Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 18:07:55 +00:00
Siavash Sameni	51e893590c	feat: WarzonePhone lossy VoIP protocol — Phase 1 complete Rust workspace with 7 crates implementing a custom VoIP protocol designed for extremely lossy connections (5-70% loss, 100-500kbps, 300-800ms RTT). 89 tests passing across all crates. Crates: - wzp-proto: Wire format, traits, adaptive quality controller, jitter buffer, session FSM - wzp-codec: Opus encoder/decoder (audiopus), Codec2 stubs, adaptive switching, resampling - wzp-fec: RaptorQ fountain codes, interleaving, block management (proven 30-70% loss recovery) - wzp-crypto: X25519+ChaCha20-Poly1305, Warzone identity compatible, anti-replay, rekeying - wzp-transport: QUIC via quinn with DATAGRAM frames, path monitoring, signaling streams - wzp-relay: Integration stub (Phase 2) - wzp-client: Integration stub (Phase 2) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-27 12:45:07 +04:00

6 Commits