wz-phone

Author	SHA1	Message	Date
Siavash Sameni	c95255d31b	fix(build): build-and-notify.sh — parameterize branch, fail loud on pull errors Two bugs caused the post-Phase-3c APK build to ship from the wrong source: 1. The remote script hardcoded `feat/android-voip-client` as the branch to pull — not the current working branch. It was never updated when we moved to android-rewrite and then opus-DRED. 2. `git reset --hard origin/feat/android-voip-client 2>/dev/null \|\| true` silently swallowed the reset failure when that branch didn't exist on the remote's origin, leaving the tree on whatever branch was there from a previous session. In our case that was feat/desktop-audio-rewrite at `d0c1731` — the android-rewrite baseline, missing every Phase 0-4 commit. The ntfy notification even included the stale commit hash `d0c1731` but nobody noticed because the hash wasn't being cross-checked against the branch the script claimed to be building. Fix: Local side (scripts/build-and-notify.sh): - Auto-detect the current local branch via `git branch --show-current` - Accept `--branch NAME` override for explicit control - Pass the branch as a third positional arg to the remote build script - Abort early if we can't determine a branch (detached HEAD) - Updated usage docs to reflect the "build whatever I'm working on" default Remote side (embedded heredoc): - Read BRANCH from $3 and abort if empty - `git fetch origin "$BRANCH"` — no piping to /dev/null, errors surface - `git reset --hard "origin/$BRANCH"` — no `\|\| true`, failures abort - Print the resolved commit hash + subject line immediately after reset so logs cross-reference the source clearly - Started/done ntfy notifications now include the branch name alongside the commit hash: "WZP Android [opus-DRED @ `953ab71`] done! APK: ..." Result: next build will (a) actually fetch the requested branch from the remote's gitea origin, (b) fail loudly if the branch doesn't exist or the reset fails, and (c) surface the branch in the ntfy notifications so future "wait, which build is this?" confusion is impossible. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 19:27:18 +04:00
Siavash Sameni	99c0173590	feat(telemetry): Phase 4 — LossRecoveryUpdate protocol + relay metrics + DebugReporter Phase 4 lays the telemetry foundation for distinguishing DRED recoveries from classical PLC in production: a new SignalMessage variant, two new per-session Prometheus counters on the relay side, and a highlighted loss-recovery section in the Android DebugReporter. The periodic emitter (client → relay) and Grafana panel are deferred to Phase 4b — this commit ships the protocol surface, the relay sink, and the immediate user-visible debug output. Once 4b lands the full path (emitter → relay → Prometheus → Grafana), the metrics here will automatically start receiving data. Scope decision — why not extend QualityReport instead: The existing wire-format QualityReport is a fixed 4-byte media packet trailer. Adding counter fields to it would shift the binary layout and break backward compatibility (old receivers would parse the last 4 bytes of the extended trailer as QR, corrupting audio). Using a new SignalMessage variant on the reliable QUIC signal stream sidesteps the wire-format problem entirely — serde JSON enums tolerate unknown variants gracefully on old receivers, and the signal channel is the right layer for periodic telemetry aggregates. Changes: wzp-proto/src/packet.rs: - New SignalMessage::LossRecoveryUpdate variant carrying: * dred_reconstructions: u64 (monotonic since call start) * classical_plc_invocations: u64 (monotonic) * frames_decoded: u64 (for rate calculation) - All three fields tagged #[serde(default)] for forward compat. wzp-client/src/featherchat.rs: - Added a match arm so signal_to_call_type() handles the new variant (treat as Offer for featherChat bridging purposes). wzp-relay/src/metrics.rs: - Two new IntCounterVec metrics on the relay, labeled by session_id: * wzp_relay_session_dred_reconstructions_total * wzp_relay_session_classical_plc_total - New method update_session_loss_recovery(session_id, dred, plc) applies monotonic deltas: if the incoming totals exceed the current counter, the difference is inc_by'd. If the incoming totals are LOWER (client restart or counter reset), the Prometheus counter holds steady until the client catches up. This matches the existing update_session_buffer delta pattern. - remove_session_metrics() now cleans up the two new labels. - New test session_loss_recovery_monotonic_delta exercises: * initial population (10 DRED, 2 PLC) * forward advance (25, 5 → delta +15, +3) * lower values ignored (client reset → counters unchanged) * client catches up (30, 8 → advances to new max) - Existing session_metrics_cleanup test extended to cover the new counters. android/app/src/main/java/com/wzp/debug/DebugReporter.kt: - Phase 4 users — and incident responders — need to quickly see whether DRED is actually firing during a call. The stats JSON already carries the counters (after Phase 3c), but they were buried in the trailing JSON dump. Added a dedicated "=== Loss Recovery ===" section to the meta preamble that extracts dred_reconstructions, classical_plc_invocations, frames_decoded, and fec_recovered from the JSON and displays them plainly, plus computed percentages when frames_decoded > 0. - New extractLongField helper: tiny hand-rolled JSON integer extractor. We don't want to pull in a full JSON parser for this single use case and CallStats has a flat, well-known schema. Verification: - cargo check --workspace: zero errors - cargo test -p wzp-proto --lib: 63 passing - cargo test -p wzp-codec --lib: 68 passing - cargo test -p wzp-client --lib: 35 passing (+1 ignored probe) - cargo test -p wzp-relay --lib: 68 passing (+1 new Phase 4 test) - cargo check -p wzp-android --lib: zero errors - Android APK build verified earlier today (unridden-alfonso.apk via the remote Docker builder) — Phase 0–3c confirmed to compile end-to-end on the NDK target. Phase 4b remaining (not blocking this commit): - Periodic LossRecoveryUpdate emitter in wzp-client/src/call.rs and wzp-android/src/engine.rs (every ~5 s) - Relay-side handler in main.rs that matches the new variant and calls metrics.update_session_loss_recovery - Grafana "Loss recovery breakdown" panel in docs/grafana-dashboard.json Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 19:21:04 +04:00
Siavash Sameni	953ab71392	feat(codec): Phase 3c — Android engine.rs DRED reconstruction on packet loss Phase 3c mirrors Phase 3b on the Android receive path. With Phase 0-3b landed on desktop + Android encoder, this commit completes codec-layer loss recovery on the Android decoder side. Architectural difference vs desktop: engine.rs has NO jitter buffer. The recv task reads packets directly from the transport via recv_media().await and writes decoded audio straight into the playout ring. There is no PlayoutResult::Missing equivalent. Gap detection therefore has to be done via sequence-number tracking — when a packet arrives with seq > expected_seq, the frames in between are missing and we attempt to reconstruct them via DRED before decoding the newly- arrived packet. Implementation: Imports & types: - Added wzp_codec::AdaptiveDecoder, wzp_codec::dred_ffi::{ DredDecoderHandle, DredState} imports. - Changed the `decoder` local from Box<dyn AudioDecoder> (via wzp_codec::create_decoder) to concrete AdaptiveDecoder::new(profile). Same reasoning as Phase 3b: reconstruct_from_dred is an inherent method, not a trait method, so we need the concrete type. Recv task state (all task-local, no new struct fields): - dred_decoder: DredDecoderHandle - dred_parse_scratch: DredState (reused, overwritten per parse) - last_good_dred: DredState (cached most-recent valid state) - last_good_dred_seq: Option<u16> - expected_seq: Option<u16> (for gap detection) - dred_reconstructions: u64 (telemetry) - classical_plc_invocations: u64 (telemetry) Recv loop body (Opus source packets only): 1. Parse DRED from the new packet first so last_good_dred reflects the freshest state available for gap recovery. 2. Detect a gap: gap = pkt.seq.wrapping_sub(expected_seq). Cap at MAX_GAP_FRAMES = 16 (320 ms) to avoid huge wraparound scenarios. 3. For each missing seq in the gap: offset = (last_good_dred_seq - missing_seq) * frame_samples if 0 < offset <= last_good_dred.samples_available(): reconstruct_from_dred + write to playout ring bump dred_reconstructions else: decoder.decode_lost (classical PLC) + write + bump plc counter 4. Decode the current packet normally and write to playout ring (unchanged from Phase 2). 5. Update expected_seq = pkt.seq.wrapping_add(1). Profile-switch handling: when the incoming codec changes (triggering decoder.set_profile), reset last_good_dred_seq and expected_seq to None. The cached DRED state is tied to the old profile's frame rate and would produce wrong offsets after the switch; starting fresh is correct. Decode-error fallback: the existing `Err(e) => decode_lost` branch now also increments classical_plc_invocations so the counter accurately reflects all PLC invocations (gap-detected AND decode- error-triggered). Telemetry (CallStats additions): - stats.dred_reconstructions: u64 - stats.classical_plc_invocations: u64 Both updated on every packet arrival in the existing stats.lock() block alongside frames_decoded/fec_recovered, so the Android UI and JNI bridge already have these values without any further plumbing. The periodic recv stats log now includes both counters. Ordering note: DRED gap reconstruction happens BEFORE decoding the new packet's audio because the playout ring is FIFO. Gap samples must be written before the new packet's samples so temporal order is preserved. Out-of-order late arrivals (seq < expected_seq) are naturally dropped as stale by the gap detection (gap would be a large wraparound value exceeding MAX_GAP_FRAMES). Verification: - cargo check --workspace: zero errors - cargo test -p wzp-codec --lib: 68 passing (unchanged from Phase 3b) - cargo test -p wzp-client --lib: 35 passing (unchanged from Phase 3b) - cargo check -p wzp-android --lib: zero errors - cargo test -p wzp-android cannot run on macOS host (pre-existing -llog linker dep, unrelated). Real end-to-end verification happens via the Android APK build on the remote Docker builder (scripts/build-and-notify.sh). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 19:06:45 +04:00
Siavash Sameni	662b14a2af	feat(codec): Phase 3b — CallDecoder DRED reconstruction on packet loss Phase 3b of the DRED integration — wires the Phase 3a FFI primitives into the desktop receive path. When the jitter buffer reports a missing Opus frame, CallDecoder now attempts to reconstruct the audio from the most recently parsed DRED side-channel state before falling through to classical PLC. Architectural refinement vs the PRD's literal wording: the PRD said "jitter buffer takes a Box<dyn DredReconstructor>". After checking deps, wzp-transport depends only on wzp-proto (not wzp-codec). Putting DRED state in the jitter buffer would require a new cross-crate dep and couple the codec-agnostic buffer to libopus. Instead, this commit keeps the DRED state ring and reconstruction dispatch inside CallDecoder (one layer up from the jitter buffer), intercepting the existing PlayoutResult::Missing signal. Same lookahead/backfill semantics, cleaner layering, zero change to wzp-transport. Changes: CallDecoder field type: Box<dyn AudioDecoder> → AdaptiveDecoder. Required because Phase 3b calls the inherent reconstruct_from_dred method, which cannot live on the AudioDecoder trait without dragging libopus DredState through wzp-proto. In practice AdaptiveDecoder was the only AudioDecoder implementor anyway — the trait abstraction was buying nothing. Method call sites unchanged because AdaptiveDecoder also implements AudioDecoder. New CallDecoder fields: - dred_decoder: DredDecoderHandle - dred_parse_scratch: DredState (scratch for parse_into) - last_good_dred: DredState (cached most-recent valid state) - last_good_dred_seq: Option<u16> - dred_reconstructions: u64 (Phase 4 telemetry) - classical_plc_invocations: u64 (Phase 4 telemetry) CallDecoder::ingest — on Opus non-repair packets, parse DRED into the scratch state. On success (samples_available > 0), std::mem::swap the scratch into last_good_dred and record the seq. This is O(1) per packet, zero allocation after construction (the two DredState buffers are allocated once in new() and reused forever). CallDecoder::decode_next — on PlayoutResult::Missing(seq) for Opus profiles: if last_good_dred_seq > seq and the seq delta × frame_samples fits within samples_available, call audio_dec.reconstruct_from_dred and bump dred_reconstructions. Otherwise fall through to classical PLC and bump classical_plc_invocations. The Codec2 path always falls through to classical PLC since DRED is libopus-only and AdaptiveDecoder::reconstruct_from_dred rejects Codec2 tiers explicitly. OpusDecoder and AdaptiveDecoder: new inherent reconstruct_from_dred method that delegates to the underlying DecoderHandle. Needed to bridge CallDecoder's wzp-client code to the Phase 3a FFI wrappers without touching the AudioDecoder trait. CRITICAL FINDING — raised DRED loss floor from 5% to 15%: Phase 3b testing discovered that libopus 1.5's DRED emission window scales aggressively with OPUS_SET_PACKET_LOSS_PERC. Empirical data (see probe_dred_samples_available_by_loss_floor, an #[ignore]'d diagnostic test in call.rs): loss_pct samples_available effective_ms 5% 720 15 ms (useless!) 10% 2640 55 ms 15% 4560 95 ms 20% 6480 135 ms 25%+ 8400 (capped) 175 ms (~87% of 200 ms configured) The Phase 1 default of 5% produced only a 15 ms reconstruction window — too small to even cover a single 20 ms Opus frame. DRED was effectively disabled even though it was emitting bytes. Raised the floor to 15% (95 ms window) as the minimum that actually provides single-frame loss recovery. This updates Phase 1's DRED_LOSS_FLOOR_PCT constant in opus_enc.rs and the accompanying module docstring. Trade-off: 15% assumed loss slightly increases encoder bitrate overhead on clean networks. Measured via the existing phase1 bitrate probe: Before (5% floor): 3649 bytes/sec at Opus 24k + 300 Hz sine After (15% floor): 3568 bytes/sec at Opus 24k + 300 Hz sine The delta is within noise — 15% isn't meaningfully more expensive than 5% on this signal, which suggests the DRED emission size is signal- dependent rather than loss-dependent for small values. Net result: we get a 6x larger reconstruction window for essentially free. Tests (+3 DRED recovery, +1 #[ignore]'d probe): - opus_single_packet_loss_is_recovered_via_dred — full encode → ingest → decode_next loop with one packet dropped mid-stream. Asserts dred_reconstructions ≥ 1 and observes the exact counter deltas. - opus_lossless_ingest_never_triggers_dred_or_plc — baseline behavior, lossless stream never takes the Missing branch. - codec2_loss_falls_through_to_classical_plc — Codec2 never reconstructs via DRED even if state were populated (which it won't be — Codec2 packets don't carry DRED bytes). - probe_dred_samples_available_by_loss_floor — #[ignore]'d diagnostic that sweeps loss_pct values and prints the resulting DRED window sizes. Kept for future tuning work. New CallDecoder introspection accessors (public but undocumented in the PRD): last_good_dred_seq() and last_good_dred_samples_available() for test diagnostics and future telemetry surfaces in Phase 4. Verification: - cargo check --workspace: zero errors - cargo test -p wzp-codec --lib: 68 passing (Phase 3a baseline held) - cargo test -p wzp-client --lib: 35 passing (+3 Phase 3b tests, +1 ignored diagnostic, no regressions) Next up: Phase 3c mirrors this on the Android engine.rs receive path. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 18:55:25 +04:00
Siavash Sameni	b830f29e66	feat(codec): Phase 3a — DRED FFI primitives (DredDecoderHandle + DredState) Phase 3a of the DRED integration — the foundation for codec-layer loss recovery. Adds three new safe wrappers to crates/wzp-codec/src/dred_ffi.rs over the raw opusic-sys FFI, plus the reconstruction method on the existing DecoderHandle. No call-site integration yet — that lands in Phase 3b (desktop) and Phase 3c (Android). New types: - `DredDecoderHandle`: owns mut OpusDREDDecoder from opus_dred_decoder_create. Used for parsing DRED side-channel data out of arriving Opus packets. This is a SEPARATE libopus object from OpusDecoder — it has its own internal state. Freed via opus_dred_decoder_destroy on Drop. - `DredState`: owns mut OpusDRED from opus_dred_alloc (a fixed ~10.6 KB buffer per libopus 1.5). Holds parsed DRED data between the parse and reconstruct steps. Reusable — parse_into overwrites contents. Tracks samples_available as a cached u32 so callers don't thread the value separately. Freed via opus_dred_free on Drop. New methods: - `DredDecoderHandle::parse_into(&mut self, state: &mut DredState, packet)` wraps opus_dred_parse with max_dred_samples=48000 (1s max), sampling_rate =48000, defer_processing=0. Returns the positive sample offset of the first decodable DRED sample, 0 if no DRED is present, or an error. Populates state.samples_available so subsequent reconstruct calls know the valid offset range. - `DecoderHandle::reconstruct_from_dred(&mut self, state, offset_samples, output)` wraps opus_decoder_dred_decode. Reconstructs audio at a specific sample position (positive, measured backward from the DRED anchor packet) into a caller-provided output buffer. Validates that 0 < offset_samples <= state.samples_available() before calling the FFI to catch range bugs. Tests (+7, wzp-codec total: 68 passing): - dred_decoder_handle_creates_and_drops - dred_state_creates_and_drops - dred_state_reset_zeroes_counter - dred_parse_and_reconstruct_roundtrip — end-to-end validation. Encodes 60 frames of a 300 Hz sine wave through a DRED-enabled Opus 24k encoder, parses DRED state out of each arriving packet, asserts that at least one packet carries non-zero samples_available (DRED warm-up completes within the first second), then reconstructs 20 ms of audio from inside the window and asserts non-zero total energy. This is the hard signal that the full libopus 1.5 DRED FFI chain is correctly wired on our side. - reconstruct_with_out_of_range_offset_errors — offset > samples_available is rejected at the Rust layer before the FFI call. - reconstruct_with_zero_offset_errors — offset <= 0 rejected. - dred_parse_empty_packet_returns_zero — graceful handling of empty input. Architectural note (divergence from PRD's literal wording): The PRD said "jitter buffer takes a Box<dyn DredReconstructor>". After checking Cargo.toml for wzp-transport, it does NOT depend on wzp-codec — only wzp-proto. Adding a DRED state ring inside the jitter buffer would require a new cross-crate dependency and couple the codec-agnostic jitter buffer to libopus internals. Instead, Phase 3b will put the DRED state ring and reconstruction dispatch in CallDecoder (one layer up from the jitter buffer), intercepting the existing PlayoutResult::Missing signal and attempting reconstruction before falling through to classical PLC. The jitter buffer itself stays unchanged. Same lookahead/backfill semantics, cleaner layering. PRD's intent preserved, implementation refined. Verification: - cargo check --workspace: zero errors - cargo test -p wzp-codec --lib: 68 passing (61 Phase 2 baseline + 7 new) - The roundtrip test is the acceptance gate — it proves that opus_dred_decoder_create, opus_dred_alloc, opus_dred_parse, and opus_decoder_dred_decode all work correctly through our wrappers on real libopus 1.5.2 output. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 17:51:15 +04:00
Siavash Sameni	d5c298d0b5	feat(codec): Phase 2 — remove RaptorQ from Opus tiers, Codec2 unchanged Phase 2 of the DRED integration (docs/PRD-dred-integration.md). With Phase 1 having enabled DRED on every Opus profile, the app-level RaptorQ layer is now redundant overhead on those tiers: +20% bitrate, +40–100 ms receive-side latency (block wait), +CPU for stats we never used. This phase removes RaptorQ from the Opus encode and decode paths on both the desktop (wzp-client/call.rs) and Android (wzp-android/engine.rs) sides. Codec2 tiers keep RaptorQ with their current ratios unchanged — DRED is libopus-only and Codec2 has no neural equivalent. Encoder changes (the real bandwidth / CPU win): - CallEncoder::encode_frame and engine.rs encode loop now gate the RaptorQ path on !codec.is_opus(): - Opus source packets emit fec_block=0, fec_symbol=0, fec_ratio_encoded=0 in the MediaHeader - fec_enc.add_source_symbol is skipped on Opus - generate_repair + repair packet emission is skipped on Opus - block_id and frame_in_block counters stay frozen at 0 for Opus - Codec2 path is byte-for-byte identical to pre-Phase-2 behavior. Decoder changes (mostly cleanup, since both live decoder paths were already reading audio directly from source packets and only using the RaptorQ decoder output for stats): - CallDecoder::ingest skips fec_dec.add_symbol on Opus packets. Source packets still flow to the jitter buffer; Opus repair packets from old senders are dropped cleanly (repair packets never hit the jitter buffer either). - engine.rs recv loop skips fec_dec.add_symbol, fec_dec.try_decode, and fec_dec.expire_before on Opus packets. The `fec_recovered` stat counter becomes Codec2-only (a separate DRED reconstruction counter lands in Phase 4). Wire-format backward compat verified at pre-flight: - Old receiver + new sender: engine.rs pipeline.rs path gates on non-zero fec_block/fec_symbol which now never fire for Opus, so the RaptorQ decoder simply isn't fed. Audio flows normally. Desktop CallDecoder's old path accumulated packets into the stale-eviction HashMap, which cleans up after 2s — harmless. - New receiver + old sender: new receiver skips RaptorQ on Opus so old-sender repair packets are ignored entirely (no crash, no double- decode). Loses the (previously vestigial) RaptorQ recovery benefit, which was never actually active in the audio path. Source packets still decode normally. - No wire format version bump required. MediaHeader is unchanged; we just zero the FEC fields on Opus packets. Test changes: - Removed `encoder_generates_repair_on_full_block` — asserted the old (pre-Phase-2) RaptorQ-on-Opus behavior and is now incorrect. Replaced with two symmetric tests: - `opus_source_packets_have_zero_fec_header_fields` — verifies Phase 2 invariants on Opus packets - `opus_encoder_never_emits_repair_packets` — runs 20 frames of non-silent sine wave through a GOOD-profile encoder, asserts exactly 20 output packets, zero repair - `codec2_encoder_generates_repair_on_full_block` — same shape as the old test but on CATASTROPHIC profile (Codec2 1200, 8 frames/block, ratio 1.0) to verify Codec2 path still emits repairs as before Verification: - cargo check --workspace: zero errors - cargo test -p wzp-codec --lib: 61 passing (Phase 1 baseline held) - cargo test -p wzp-client --lib: 32 passing (+3 new Phase 2 tests, -1 old test removed) - cargo check -p wzp-android --lib: zero errors (host link of wzp-android tests fails on -llog per pre-existing Android-only build.rs, unrelated to this work; integration build via build-and-notify.sh will validate Android end-to-end) - Pre-existing broken integration test in crates/wzp-client/tests/handshake_integration.rs (SignalMessage schema drift) is NOT caused by this commit — baseline had the same 3 compile errors before Phase 2. Flagged as a separate cleanup task. Expected observable effects on a real call: - Opus 24k outgoing bitrate drops from ~28.8 kbps (ratio 0.2 RaptorQ) to ~25 kbps (base 24 kbps + DRED ~1–10 kbps signal-dependent) - Opus receive-side latency drops ~40 ms on clean network (no more block wait — jitter buffer emits as soon as a source packet arrives) - Codec2 calls show no latency or bitrate change Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 17:42:33 +04:00
Siavash Sameni	4090206909	feat(codec): Phase 1 — enable DRED on all Opus profiles, disable inband FEC Phase 1 of the DRED integration (docs/PRD-dred-integration.md). The Opus encoder now emits DRED (Deep REDundancy) bytes in every packet, carrying a neural-coded history of recent audio that the decoder can use to reconstruct loss bursts up to the configured window. Opus inband FEC (LBRR) is disabled because DRED does the same job better and running both wastes bitrate on overlapping protection. Tiered DRED duration policy per PRD: Studio (Opus 32k/48k/64k): 10 frames = 100 ms Normal (Opus 16k/24k): 20 frames = 200 ms Degraded (Opus 6k): 50 frames = 500 ms Each profile switch (via adaptive quality) updates the DRED duration to match the new tier. A 5% packet_loss floor is applied whenever DRED is active, because libopus 1.5 gates DRED emission on non-zero packet_loss. Real loss measurements from the quality adapter override upward. Escape hatch: AUDIO_USE_LEGACY_FEC=1 reverts the encoder to Phase 0 behavior (inband FEC Mode1, DRED off, no loss floor). Read once at OpusEncoder::new; call-scoped, not re-read mid-call. Trait-level set_inband_fec becomes a no-op in DRED mode to preserve the invariant even if external callers forget. Observations from the bitrate probe test (dred_mode_roundtrip_voice_pattern): DRED mode: 3649 bytes/sec (~29.2 kbps) on Opus 24k + 300 Hz sine Legacy mode: 2383 bytes/sec (~19.1 kbps) Delta: +10.1 kbps The delta is considerably larger than the "+1 kbps flat" figure I carried into the PRD from hazy memory of published DRED benchmarks. Likely because the input (300 Hz sine) is very compressible so the base Opus rate in legacy mode is well below the 24 kbps target, making the delta look disproportionate. Signal-dependent — real speech would probably show a different ratio. If production telemetry shows the overhead is excessive, we can cut DRED duration on the normal tier from 200 ms to 100 ms as a first tuning lever. Not blocking Phase 1 since the test still passes within the reasonable 2000–8000 bytes/sec bounds. Test changes (+8 tests, total wzp-codec: 61 passing): - dred_duration_for_studio_tiers_is_100ms (per-profile policy) - dred_duration_for_normal_tiers_is_200ms - dred_duration_for_degraded_tier_is_500ms - dred_duration_for_codec2_is_zero - default_mode_is_dred_not_legacy (sanity check on fresh construction) - dred_mode_roundtrip_voice_pattern (observes DRED bitrate, asserts bounds) - profile_switch_refreshes_dred_duration (verifies set_profile updates DRED) - set_inband_fec_noop_in_dred_mode (trait-level inband FEC no-op) Verification: - cargo check --workspace: zero errors, no new warnings - cargo test -p wzp-codec: 61/61 passing (53 pre-Phase-1 baseline + 8 new) - Empirical DRED bitrate observed via `rtk proxy cargo test dred_mode_roundtrip_voice_pattern -- --nocapture` Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 17:26:34 +04:00
Siavash Sameni	086a74782f	feat(codec): Phase 0 — swap audiopus → opusic-c + opusic-sys (libopus 1.5.2) Phase 0 of the DRED integration (docs/PRD-dred-integration.md). No behavior change: inband FEC stays ON, no DRED, same bitrate, same quality. This commit unblocks Phase 1+ by getting us onto libopus 1.5.2 where DRED lives. Rationale for going straight to a custom DecoderHandle: opusic-c::Decoder's inner mut OpusDecoder pointer is pub(crate), so we cannot reach it for the Phase 3 DRED reconstruction path. Running two parallel decoders (one for audio, one for DRED) would drift because the DRED decoder wouldn't see normal decode calls. Single unified DecoderHandle over raw opusic-sys is the only correct architecture, so we build it in Phase 0 rather than rewriting opus_dec.rs twice. Changes: - Cargo.toml (workspace + wzp-codec): remove audiopus 0.3.0-rc.0, add opusic-c 1.5.5 (bundled + dred features), opusic-sys 0.6.0 (bundled), bytemuck 1. Pinned exactly for reproducible libopus 1.5.2. - opus_enc.rs: rewritten against opusic_c::Encoder. Argument order for Encoder::new swapped (Channels first). set_inband_fec(bool) now maps to InbandFec::Mode1 (the libopus 1.5 equivalent of 1.3's LBRR). encode uses bytemuck::cast_slice<i16,u16> at the &[u16] boundary. - dred_ffi.rs (new): DecoderHandle wrapping mut OpusDecoder directly via opusic-sys. Owns the allocation, frees on Drop. Exposes decode, decode_lost, and a pub(crate) as_raw_ptr() for the future Phase 3 DRED reconstruction. Send+Sync justified via &mut self access discipline. - opus_dec.rs: rewritten as a thin AudioDecoder impl over DecoderHandle. Behavior identical to pre-swap. Verification (Phase 0 acceptance gates): - cargo check --workspace: clean (30 pre-existing warnings in jni_bridge.rs unrelated to this work; zero in changed files). - cargo test -p wzp-codec: 53 tests pass (50 pre-swap + 6 new: 3 in dred_ffi.rs for DecoderHandle lifecycle, 3 in opus_enc.rs for version check and roundtrip). - linked_libopus_is_1_5 test asserts opusic_c::version() contains "1.5" — hard signal that the swap landed correctly. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 17:15:55 +04:00
Siavash Sameni	09259cd6b8	docs: add PRD for DRED integration and Opus-tier FEC simplification Plans the libopus 1.5.2 upgrade (audiopus → opusic-c/opusic-sys), DRED enablement with tiered durations (100/200/500ms studio/normal/degraded), removal of RaptorQ and Opus inband FEC from the Opus tiers, jitter buffer lookahead/backfill refactor, and runtime escape hatch for rollout safety. RaptorQ + current ratios preserved on Codec2 tiers (no DRED there). Includes pre-flight verification findings: opusic-c Decoder inner pointer is inaccessible (requires unified opusic-sys DecoderHandle), libopus 1.5 DRED API semantics clarified against xiph/opus opus.h, wire-format backward compat verified on both live receive paths. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 17:04:11 +04:00
Siavash Sameni	75bc72a884	docs: add BRANCH-android-rewrite.md and update ARCH/ADMIN/USER_GUIDE Documents the android-rewrite branch story end-to-end: - Why the Kotlin+JNI stack was abandoned (stack overflow, libcrypto TLS race, __init_tcb TCB leak, ring runtime reuse crash) - The Tauri 2.x Mobile pivot that reuses the desktop codebase verbatim - Android-specific pieces: wzp-native standalone cdylib loaded via libloading, android_audio.rs JVM routing, Oboe audio config quirks - Build pipeline via build-tauri-android.sh + wzp-android-builder image - Known quirks (API 34/36 coexistence, NDK path absolutes, etc.) Also appends shared-doc sections (identical on both branches): - ARCHITECTURE.md: "Audio Backend Architecture (Platform Matrix)" covering CPAL / VPIO / WASAPI / Oboe backends, selection matrix, the wzp-native cdylib rationale, and the vendored audiopus_sys fix. - ADMINISTRATION.md: "Build Pipelines" with Docker images (wzp-android-builder, wzp-windows-builder), per-pipeline usage (Android APK, Linux x86_64, Windows .exe), the Hetzner Cloud alternative, ntfy/rustypaste integration, and credential locations. - USER_GUIDE.md: "Direct 1:1 Calling (Desktop + Android)" covering history + recent contacts + deregister UI, and "Windows AEC Variants" explaining the AEC vs noAEC builds and driver caveats. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 15:20:12 +04:00
Siavash Sameni	6aa52accef	feat(android): Tauri 2.x mobile build infrastructure Adds infrastructure for building the Tauri 2.x Android app (the pivot away from the Kotlin+JNI approach whose stack overflow / libcrypto TLS crash / thread lifecycle hell is documented in the incident report): - scripts/Dockerfile.android-builder: extended to support both the legacy Kotlin+JNI pipeline (cargo-ndk + Gradle) and the new Tauri mobile pipeline (tauri-cli + Node/npm). Adds Node.js 20 LTS, API level 36 + build-tools 35.0.0, and additional apt packages. - scripts/build-tauri-android.sh: fire-and-forget remote build via Docker on SepehrHomeserverdk, with ntfy.sh notifications and rustypaste upload of the resulting APK. Mirrors the pattern of build-tauri-android-docker.sh but targets the new Tauri pipeline. - docs/incident-tauri-android-init-tcb.md: postmortem of the Kotlin+JNI crash cascade that drove the Tauri mobile rewrite decision. Covers the __init_tcb / pthread_create bionic private symbol leak, the staticlib + cdylib crate-type interaction, the Dispatchers.IO 512 KB thread stack overflow, and the tokio runtime / libcrypto TLS race. - scripts/mint-tmux.sh, scripts/prep-linux-mint.sh: general dev infrastructure (tmux + Linux Mint workstation prep scripts). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 15:06:46 +04:00
Siavash Sameni	d0c17317ea	fix: generate seed if empty on register (fresh install), add JNI debug logging Some checks failed Mirror to GitHub / mirror (push) Failing after 41s Details Build Release Binaries / build-amd64 (push) Failing after 3m38s Details	2026-04-09 10:21:59 +04:00
Siavash Sameni	5799d18aee	debug: add tracing to nativeSignalConnect entry Some checks failed Mirror to GitHub / mirror (push) Failing after 36s Details Build Release Binaries / build-amd64 (push) Failing after 3m46s Details	2026-04-09 10:17:13 +04:00
Siavash Sameni	46c9ee1be3	fix: single thread for entire signal lifecycle — runtime never dropped (libcrypto TLS fix) Some checks failed Mirror to GitHub / mirror (push) Failing after 37s Details Build Release Binaries / build-amd64 (push) Failing after 3m52s Details	2026-04-09 10:11:33 +04:00
Siavash Sameni	b53eae9192	fix: split start() into connect+register (inline) + run() (separate thread) — avoids thread::spawn closure stack overflow Some checks failed Mirror to GitHub / mirror (push) Failing after 35s Details Build Release Binaries / build-amd64 (push) Failing after 3m26s Details	2026-04-09 10:02:07 +04:00
Siavash Sameni	a3f54566d4	fix: call nativeSignalConnect from 8MB Java Thread, not Dispatchers.IO Some checks failed Mirror to GitHub / mirror (push) Failing after 39s Details Build Release Binaries / build-amd64 (push) Failing after 3m54s Details	2026-04-09 09:50:30 +04:00
Siavash Sameni	76e9fe5e43	fix: single thread+runtime for signal lifecycle — avoids ring/libcrypto TLS conflict on pthread_exit Some checks failed Mirror to GitHub / mirror (push) Failing after 38s Details Build Release Binaries / build-amd64 (push) Failing after 3m46s Details	2026-04-09 09:44:46 +04:00
Siavash Sameni	b0a89d4f39	docs: PRD for desktop direct calling backport + UI fixes Some checks failed Mirror to GitHub / mirror (push) Failing after 36s Details Build Release Binaries / build-amd64 (push) Failing after 3m39s Details	2026-04-09 09:39:50 +04:00
Siavash Sameni	abc96e8887	refactor: separate SignalManager from WzpEngine for direct calling Some checks failed Mirror to GitHub / mirror (push) Failing after 40s Details Build Release Binaries / build-amd64 (push) Failing after 3m40s Details SignalManager (NEW): - Dedicated Rust struct with its own QUIC connection to _signal - Separate JNI handle (nativeSignalConnect/GetState/PlaceCall/etc) - Kotlin wrapper polls state every 500ms via getState() JSON - Lives independently of WzpEngine — survives across calls - connect() blocks briefly on 8MB thread, then recv loop runs on dedicated thread WzpEngine (CLEANED): - Back to pure media-only role (audio, codec, FEC, jitter) - Removed start_signaling/place_call/answer_call methods - Removed signal_transport/signal_fingerprint from EngineState CallViewModel: - Two separate managers: signalManager (persistent) + engine (per-call) - Two separate polling loops: signalPollJob + statsJob - Auto-connect to media room when signal polling detects "setup" state - hangupDirectCall() ends media but keeps signal alive Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 09:34:36 +04:00
Siavash Sameni	3a6ae61f8d	fix: show real identity fingerprint (SHA-256 full format) on Android home screen Some checks failed Mirror to GitHub / mirror (push) Failing after 39s Details Build Release Binaries / build-amd64 (push) Failing after 1m30s Details	2026-04-09 09:12:47 +04:00
Siavash Sameni	4c536d256b	fix: install rustls crypto provider once in nativeInit, not per-thread (libcrypto TLS conflict) Some checks failed Mirror to GitHub / mirror (push) Failing after 38s Details Build Release Binaries / build-amd64 (push) Failing after 4m18s Details	2026-04-09 09:07:40 +04:00
Siavash Sameni	b0ec9ff4ab	fix: signal mode UI + place_call via stored signal transport Some checks failed Mirror to GitHub / mirror (push) Failing after 37s Details Build Release Binaries / build-amd64 (push) Failing after 3m49s Details - Don't set callState for signal-only states (prevents auto-join room) - Store signal transport + fingerprint in EngineState after registration - place_call/answer_call send directly via signal transport (not command channel) - Spawn small threads for async signal sends (non-blocking) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 08:58:22 +04:00
Siavash Sameni	5855533a39	fix: start stats polling before blocking startSignaling call Some checks failed Mirror to GitHub / mirror (push) Failing after 39s Details Build Release Binaries / build-amd64 (push) Failing after 3m46s Details	2026-04-09 08:38:06 +04:00
Siavash Sameni	ed09c2e8cc	fix: use block_on pattern for signaling (same as start_call) — no thread::spawn Some checks failed Mirror to GitHub / mirror (push) Failing after 37s Details Build Release Binaries / build-amd64 (push) Failing after 3m50s Details	2026-04-09 08:33:08 +04:00
Siavash Sameni	f44306cc17	fix: move ALL signaling code into JNI-spawned 8MB thread — zero Rust on caller stack Some checks failed Mirror to GitHub / mirror (push) Failing after 40s Details Build Release Binaries / build-amd64 (push) Failing after 3m51s Details	2026-04-09 08:19:48 +04:00
Siavash Sameni	0b821585ab	fix: call nativeStartSignaling from Java Thread with 8MB stack, not Kotlin IO dispatcher Some checks failed Mirror to GitHub / mirror (push) Failing after 38s Details Build Release Binaries / build-amd64 (push) Failing after 3m32s Details	2026-04-09 08:10:22 +04:00
Siavash Sameni	faec332a8c	fix: remove panic::catch_unwind from nativeStartSignaling — stack overflow on Android Some checks failed Mirror to GitHub / mirror (push) Failing after 42s Details Build Release Binaries / build-amd64 (push) Failing after 3m28s Details	2026-04-09 08:04:47 +04:00
Siavash Sameni	fe9ae276dc	fix: move all crypto/network work to spawned 8MB thread — Android stack too small Some checks failed Mirror to GitHub / mirror (push) Failing after 37s Details Build Release Binaries / build-amd64 (push) Failing after 3m25s Details	2026-04-09 07:16:54 +04:00
Siavash Sameni	4fbf6770c4	fix: Android signal thread stack overflow + add version marker to UI Some checks failed Mirror to GitHub / mirror (push) Failing after 40s Details Build Release Binaries / build-amd64 (push) Failing after 3m47s Details - Spawn signaling on dedicated thread with 4MB stack instead of using Android's IO dispatcher thread (insufficient stack for tokio + QUIC) - Add "direct-call-v1" version marker to home screen subtitle Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 07:10:07 +04:00
Siavash Sameni	30a893a73f	fix: remove duplicate TextAlign import causing Android build failure Some checks failed Mirror to GitHub / mirror (push) Failing after 38s Details Build Release Binaries / build-amd64 (push) Failing after 3m34s Details	2026-04-09 06:54:45 +04:00
Siavash Sameni	d46f3b1deb	fix: show more Gradle output in build log for debugging Some checks failed Mirror to GitHub / mirror (push) Failing after 36s Details Build Release Binaries / build-amd64 (push) Failing after 3m55s Details	2026-04-09 06:48:14 +04:00
Siavash Sameni	0d3f0d4dcb	feat: Android UI for direct 1:1 calling Some checks failed Mirror to GitHub / mirror (push) Failing after 36s Details Build Release Binaries / build-amd64 (push) Failing after 3m51s Details - Mode toggle: "Room" vs "Direct Call" tabs on pre-connection screen - Direct Call mode: Register button → registers on relay signal channel - After registration: shows fingerprint dial pad + incoming call panel - Incoming call: green Accept / red Reject buttons with caller info - Ringing state display while waiting for callee - CallSetup auto-connects to media room - CallStats extended: sas_code, incoming_call_id/fp/alias fields - CallViewModel: registerForCalls(), placeDirectCall(), answerIncomingCall() Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 06:18:07 +04:00
Siavash Sameni	c184d5e1f3	fix: build scripts use fetch+reset instead of pull to avoid ref lock errors Some checks failed Mirror to GitHub / mirror (push) Failing after 37s Details Build Release Binaries / build-amd64 (push) Failing after 3m30s Details git pull fails when refs are stale from concurrent builds. Switch to git gc + git fetch + git reset --hard origin/branch for robustness. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 06:07:10 +04:00
Siavash Sameni	5d8e743cbf	feat: Android engine + Kotlin API for direct 1:1 calling Some checks failed Mirror to GitHub / mirror (push) Failing after 35s Details Build Release Binaries / build-amd64 (push) Failing after 3m47s Details Rust engine: - start_signaling(): persistent _signal connection, presence registration - Signal recv loop: handles DirectCallOffer, CallRinging, CallSetup, Hangup - New CallState variants: Registered, Ringing, IncomingCall - Stats expose incoming_call_id, incoming_caller_fp, incoming_caller_alias, sas_code - New EngineCommands: PlaceCall, AnswerCall, RejectCall JNI bridge: - nativeStartSignaling(relay, seed, token, alias) - nativePlaceCall(targetFp) - nativeAnswerCall(callId, mode) Kotlin API (WzpEngine.kt): - startSignaling(relay, seed, token, alias) - placeCall(targetFingerprint) - answerCall(callId, mode) — 0=Reject, 1=AcceptTrusted, 2=AcceptGeneric Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 06:02:48 +04:00
Siavash Sameni	6694aebfd9	fix: resolve 0.0.0.0 to connectable address in CallSetup relay_addr Some checks failed Mirror to GitHub / mirror (push) Failing after 35s Details Build Release Binaries / build-amd64 (push) Failing after 3m36s Details When relay listens on 0.0.0.0, derive the actual IP from the client's connection address for the CallSetup message. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 05:56:19 +04:00
Siavash Sameni	d27e85ecf2	feat: SAS (Short Authentication String) for call identity verification Some checks failed Mirror to GitHub / mirror (push) Failing after 35s Details Build Release Binaries / build-amd64 (push) Failing after 3m19s Details Derive a 4-digit code from the shared DH secret via HKDF with label "warzone-sas-code". Both peers compute the same code; a MITM relay produces a different one. Users compare verbally during the call. - CryptoSession::sas_code() -> Option<u32> on the trait - ChaChaSession stores and returns the SAS - HKDF derivation in WarzoneKeyExchange::derive_session() - Tests: both peers match, MITM produces different code Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 05:48:08 +04:00
Siavash Sameni	39ac181d63	feat: ACL + capacity limit on call rooms, unified fingerprint format Some checks failed Mirror to GitHub / mirror (push) Failing after 37s Details Build Release Binaries / build-amd64 (push) Failing after 3m38s Details - Call rooms (call-*) restricted to the two authorized participants only - Room capacity enforced at 2 for call rooms - Unauthorized clients get immediate connection close - Unified fingerprint format: SHA-256(Ed25519 pub)[:16] as xxxx:xxxx:... Used consistently in signal registration, handshake, and ACL checks Tested: Alice+Bob authorized, attacker rejected with "not authorized" Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 05:43:03 +04:00
Siavash Sameni	3351cb6473	feat: direct 1:1 calling via relay signaling (Phase 1) Some checks failed Mirror to GitHub / mirror (push) Failing after 35s Details Build Release Binaries / build-amd64 (push) Failing after 3m43s Details New feature: call someone directly by fingerprint through the relay. - Client connects with SNI "_signal" for persistent signaling - RegisterPresence/RegisterPresenceAck for relay registration - DirectCallOffer routed to target by fingerprint - DirectCallAnswer with AcceptGeneric/AcceptTrusted/Reject modes - Relay creates private room (call-{id}), sends CallSetup to both - Both clients connect to private room for media (existing SFU path) - Hangup forwarding + cleanup on disconnect - Desktop CLI: --signal + --call <fingerprint> for testing - CallRegistry tracks call state (Pending/Ringing/Active/Ended) - SignalHub manages persistent signaling connections Tested: Alice calls Bob by fingerprint, relay routes offer, Bob auto-accepts, both join private room, media flows bidirectionally. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 05:35:16 +04:00
Siavash Sameni	54a4d91f3e	docs: add --event-log, --version-check, and federation troubleshooting to admin guide Some checks failed Mirror to GitHub / mirror (push) Failing after 35s Details Build Release Binaries / build-amd64 (push) Failing after 3m32s Details Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 04:43:37 +04:00
Siavash Sameni	3b962bd4cb	fix: build scripts use git reset --hard before pull to recover from dirty state Some checks failed Mirror to GitHub / mirror (push) Failing after 1m14s Details Build Release Binaries / build-amd64 (push) Failing after 4m13s Details Cargo.lock changes from Docker builds caused pull conflicts. Now uses reset --hard + clean -fd to guarantee clean state before pulling. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:13:26 +04:00
Siavash Sameni	1118eac752	fix: re-enable FEC + time-based dedup for federation Some checks failed Mirror to GitHub / mirror (push) Failing after 2m7s Details Build Release Binaries / build-amd64 (push) Has been cancelled Details Restore fec_ratio=0.2 on GOOD profile. Time-based dedup (2s TTL) with payload hash prevents consecutive sender collisions while still catching multi-path duplicates. Verified: 6 consecutive senders across 2 relays, 0 decode errors, 0 drops, FEC active. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:09:15 +04:00
Siavash Sameni	f935bd69cd	fix: rewrite seq/fec for federation-delivered packets Some checks failed Build Release Binaries / build-amd64 (push) Failing after 2m48s Details Mirror to GitHub / mirror (push) Failing after 4m2s Details - Time-based dedup (2s TTL) replaces fixed-window dedup — consecutive senders with same seq numbers no longer collide - Raw byte forwarding for federation local delivery (no re-serialization) - Jitter buffer resets on large backward seq jumps (>100) - recv_media skips malformed datagrams instead of returning connection-closed - SIGTERM handler for clean QUIC shutdown on wzp-client - JSONL event log infrastructure (--event-log flag) for protocol analysis - FEC disabled on GOOD profile for federation debugging (fec_ratio=0.0) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 21:55:06 +04:00
Siavash Sameni	1c684f6b47	fix: rewrite seq/fec for federation-delivered packets Some checks failed Mirror to GitHub / mirror (push) Failing after 35s Details Build Release Binaries / build-amd64 (push) Failing after 1m59s Details Federation media from different senders had conflicting seq numbers, FEC block IDs, and Opus decoder state. The relay now assigns fresh monotonic seq/fec_block/fec_symbol to all federation-delivered packets, ensuring clients see a clean continuous stream regardless of sender changes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 15:48:55 +04:00
Siavash Sameni	c92db7e9b7	fix: preserve original relay label through multi-hop presence propagation Some checks failed Mirror to GitHub / mirror (push) Failing after 35s Details Build Release Binaries / build-amd64 (push) Failing after 7m26s Details When propagating GlobalRoomActive to other peers, use tagged participants (with relay_label set to the originating relay) instead of the raw untagged participants. This shows "Relay C" instead of "Relay B" when C's participants are forwarded through hub B to A. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 15:34:22 +04:00
Siavash Sameni	c3bd657224	fix: FEC decoder resets stale blocks — fixes consecutive federation connects Some checks failed Mirror to GitHub / mirror (push) Failing after 36s Details Build Release Binaries / build-amd64 (push) Failing after 2m0s Details When a new sender reuses the same block_id values as a previous sender, the FEC decoder was silently dropping all data because blocks were marked as "already decoded". Now blocks older than 2 seconds are automatically reset when new data arrives for them. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 15:26:00 +04:00
Siavash Sameni	8b79cdc6fc	fix: dedup filter collision between different senders + build scripts default --pull Some checks failed Mirror to GitHub / mirror (push) Failing after 35s Details Build Release Binaries / build-amd64 (push) Failing after 1m53s Details - Dedup key now includes source peer fingerprint hash, preventing packets from different senders with same room+seq from being dropped as duplicates (was silently killing all multi-hop audio) - Build scripts default to --pull (use --no-pull to skip) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 15:18:52 +04:00
Siavash Sameni	2eab56beec	fix: federation presence dedup, stale cleanup, and Android SIGSEGV crash Some checks failed Mirror to GitHub / mirror (push) Failing after 29s Details Build Release Binaries / build-amd64 (push) Failing after 1m57s Details - Deduplicate remote participants by fingerprint in all merge sites (canonical == raw room name caused double-lookup, doubling every remote participant) - GlobalRoomInactive now propagates updated participant list to other peers (hub relay B was not informing A when C's participants left) - Add 15-second stale presence sweeper that purges remote participants from peers that stop sending data (safety net for QUIC timeout delays) - Add @Synchronized to WzpEngine.getStats/stopCall/destroy to prevent TOCTOU race between stats polling coroutine and engine teardown (SIGSEGV) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 15:07:59 +04:00
Siavash Sameni	7dadc1ddd6	fix: default room 'general', cap auto codec at 24k Some checks failed Mirror to GitHub / mirror (push) Failing after 36s Details Build Release Binaries / build-amd64 (push) Failing after 1m51s Details - Android default room changed from 'android' to 'general' - Relay choose_profile capped at GOOD (Opus 24k) — studio tiers (32k/48k/64k) cause high packet loss on federation paths due to larger datagrams exceeding path MTU. Will re-enable after MTU discovery is implemented. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 14:41:12 +04:00
Siavash Sameni	be0441295a	fix: read git hash outside Docker for Linux build ntfy notification Some checks failed Mirror to GitHub / mirror (push) Failing after 39s Details Build Release Binaries / build-amd64 (push) Failing after 2m1s Details The hash was read inside Docker (/build/source) where .git doesn't exist. Now reads from $BASE_DIR/data/source before Docker runs. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 14:32:03 +04:00
Siavash Sameni	b9f4e7f102	feat: include git hash in ntfy build notifications + MTU PRD Some checks failed Mirror to GitHub / mirror (push) Failing after 29s Details Build Release Binaries / build-amd64 (push) Has been cancelled Details ntfy messages now show: "WZP Linux [abc1234] ready!" and "WZP Android [abc1234] done! APK: url" so you can verify which commit was built without checking relay version remotely. Also added PRD-mtu-discovery.md for QUIC Path MTU Discovery. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 14:26:13 +04:00

1 2 3 4 5

227 Commits