diff --git a/.specstory/history/2026-01-21_08-11Z-check-files-py-small-output-error-handling.md b/.specstory/history/2026-01-21_08-11Z-check-files-py-small-output-error-handling.md
index 5b3aae8..bb5edee 100644
--- a/.specstory/history/2026-01-21_08-11Z-check-files-py-small-output-error-handling.md
+++ b/.specstory/history/2026-01-21_08-11Z-check-files-py-small-output-error-handling.md
@@ -17885,3 +17885,576 @@ So when the progress output makes it hard or impossible to type in the encode wi
 
 ---
 
+_**User**_
+
+I'm re-evaluating here.
+1. I think we'll go back to NVENC H.265 for now. I'll use nvenc av1 when I have 40 series GPUs or newer. Copy the encode settings from@encode_VOD.py
+2. The hosts are going to need to have their GPUs specified. The Unraid machines don't have valid GPUs so they can be excluded completely. Pyro and PostIrony are gpu0, RenderScrap is gpu0 AND gpu1, so it needs to be sent an ssh job per GPU.
+3. for HEVC mp4 output, we won't need to use TS containers anymore, right? can we use mp4 containers?
+
+---
+
+_**Agent (model default, mode Agent)**_
+
+<think><details><summary>Thought Process</summary>
+
+The user is re-evaluating the distributed encode setup with several changes:
+
+1. **Switch from AV1 (libaom) to NVENC H.265** - Copy encode settings from encode_VOD.py
+2. **Host/GPU mapping**: 
+   - Unraid machines (GuiltsCurse, Godzilla) - exclude completely (no valid GPUs)
+   - Pyro, PostIrony - gpu0
+   - RenderScrap - gpu0 AND gpu1 (needs 2 SSH jobs, one per GPU)
+   So the "hosts" concept becomes host+GPU pairs; RenderScrap appears twice (as RenderScrap:0 and RenderScrap:1 or similar)
+3. **Container**: Use MP4 instead of TS for HEVC output (no need for mpegts pipe/segments)
+
+I need to:
+1. Look at encode_VOD.py for the NVENC HEVC encode settings
+2. Understand current ffmpeg_distributed.py and encode_VOD_pyro.py - how hosts are used, how we'd add GPU per host and exclude some hosts
+3. Confirm MP4 vs TS - for NVENC HEVC to pipe we might still need a container; let me check. Actually for local encoding (not pipe), we can output mp4 directly. The current design: reader (ffmpeg -ss -t -i input -c copy) outputs to pipe, encoder reads pipe and writes to segment file. So we have pipe between reader and encoder. For NVENC we're running encoder on remote - so we have: local reader -> pipe -> ssh -> remote ffmpeg (encoder) -> segment file. The segment file format: we were using mpegts because libaom-av1 had issues with matroska to pipe. For NVENC HEVC, we can use:
+   - Pipe: could use mpegts or raw NAL or ... Actually ffmpeg NVENC can output to pipe as mp4? Let me think. Typically for pipe you use a streaming format. MP4 is not great for streaming/pipe because it needs seeking for moov at end. So we might still use mpegts for the pipe and then... or we could use mp4 for segments if we're writing to file on the remote (encoder writes to file, not to pipe). So: reader outputs to pipe (mpegts or similar), encoder reads pipe and writes to **file**. The segment file on disk can be .mp4 if the encoder outputs HEVC in MP4. So yes - we can have segment files as .mp4. The concat step: ffmpeg concat demuxer supports a list of files. For MP4, concat demuxer works but often re-muxes. So we'd have segment files 00000000.mp4, 00000001.mp4, ... and concat them. So the user is right - we don't need TS containers for the segment files; we can use MP4. For the **pipe** between reader and encoder we might still use a streaming format (mpegts or raw). Let me check - if we use NVENC with -f mp4 to pipe, it might not work well because mp4 needs to be seekable for the moov. So the pipe format could stay mpegts (or we use hevc in mpegts) and the encoder decodes that and re-encodes to HEVC and outputs to .mp4 file. So segments = .mp4, concat list = .mp4 files. Good.
+
+I'll read the relevant files to build the plan.
+</details></think>
+
+---
+
+
+Checking encode_VOD.py settings and how the distributed pipeline uses hosts and containers.
+
+
+---
+
+<tool-use data-tool-type="read" data-tool-name="read_file_v2">
+<details><summary>Tool use: **read_file_v2** • Read file: r:\Videos\mux\encode_VOD.py</summary>
+
+</details>
+</tool-use>
+
+---
+
+<tool-use data-tool-type="unknown" data-tool-name="ripgrep_raw_search">
+<details>
+<summary>Tool use: **ripgrep_raw_search**</summary>
+
+
+Parameters:
+
+```json
+{
+  "pattern": "nvenc|hevc|h265|-c:v|preset|bitrate",
+  "path": "r:\\Videos\\mux\\encode_VOD.py",
+  "caseInsensitive": false
+}
+```
+Result:
+
+```json
+{
+  "success": {
+    "pattern": "nvenc|hevc|h265|-c:v|preset|bitrate",
+    "path": "r:\\Videos\\mux\\encode_VOD.py",
+    "outputMode": "content",
+    "workspaceResults": {
+      "r:\\Videos\\mux": {
+        "content": {
+          "matches": [
+            {
+              "file": "encode_VOD.py",
+              "matches": [
+                {
+                  "lineNumber": 207,
+                  "content": "        '-c:v', 'hevc_nvenc',"
+                },
+                {
+                  "lineNumber": 209,
+                  "content": "        '-preset', 'p7',"
+                }
+              ]
+            }
+          ],
+          "totalLines": 2,
+          "totalMatchedLines": 2
+        }
+      }
+    }
+  }
+}
+```
+Status: **completed**
+
+
+</details>
+</tool-use>
+
+---
+
+<tool-use data-tool-type="read" data-tool-name="read_file_v2">
+<details><summary>Tool use: **read_file_v2** • Read file: r:\Videos\mux\ffmpeg_distributed.py</summary>
+
+</details>
+</tool-use>
+
+---
+
+<tool-use data-tool-type="read" data-tool-name="read_file_v2">
+<details><summary>Tool use: **read_file_v2** • Read file: r:\Videos\mux\encode_VOD_pyro.py</summary>
+
+</details>
+</tool-use>
+
+---
+
+<think><details><summary>Thought Process</summary>
+
+I need more of encode_VOD.py to get the full NVENC HEVC command (preset, bitrate, etc.) and how the output is produced.
+</details></think>
+
+---
+
+
+
+
+
+---
+
+<tool-use data-tool-type="read" data-tool-name="read_file_v2">
+<details><summary>Tool use: **read_file_v2** • Read file: r:\Videos\mux\encode_VOD.py</summary>
+
+</details>
+</tool-use>
+
+---
+
+<tool-use data-tool-type="unknown" data-tool-name="ripgrep_raw_search">
+<details>
+<summary>Tool use: **ripgrep_raw_search**</summary>
+
+
+Parameters:
+
+```json
+{
+  "pattern": "\\.ts|\\.mp4|output_path|segment|concat",
+  "path": "r:\\Videos\\mux\\ffmpeg_distributed.py",
+  "caseInsensitive": false
+}
+```
+Result:
+
+```json
+{
+  "success": {
+    "pattern": "\\.ts|\\.mp4|output_path|segment|concat",
+    "path": "r:\\Videos\\mux\\ffmpeg_distributed.py",
+    "outputMode": "content",
+    "workspaceResults": {
+      "r:\\Videos\\mux": {
+        "content": {
+          "matches": [
+            {
+              "file": "ffmpeg_distributed.py",
+              "matches": [
+                {
+                  "lineNumber": 89,
+                  "content": "def _segment_valid(segment_path: str, expected_duration_sec: float, tolerance_sec: float = 0.1) -&gt; bool:"
+                },
+                {
+                  "lineNumber": 90,
+                  "content": "    \"\"\"Return True if ffprobe succeeds on segment and duration matches expected within tolerance (like check_files.py).\"\"\""
+                },
+                {
+                  "lineNumber": 92,
+                  "content": "        ['ffprobe', '-v', 'error', '-show_entries', 'format=duration', '-of', 'csv=p=0', segment_path],"
+                },
+                {
+                  "lineNumber": 106,
+                  "content": "def _build_segments(keyframe_times: List[float], segment_seconds: float, total_duration: float) -&gt; List[Tuple[float, float]]:"
+                },
+                {
+                  "lineNumber": 107,
+                  "content": "    \"\"\"Build (start_sec, end_sec) segments at keyframe boundaries. When keyframe probe fails, use fixed segment_seconds.\"\"\""
+                },
+                {
+                  "lineNumber": 111,
+                  "content": "        # No keyframes: split at fixed intervals so we still get multiple segments"
+                },
+                {
+                  "lineNumber": 112,
+                  "content": "        segments = []"
+                },
+                {
+                  "lineNumber": 115,
+                  "content": "            end = min(start + segment_seconds, total_duration)"
+                },
+                {
+                  "lineNumber": 116,
+                  "content": "            segments.append((start, end))"
+                },
+                {
+                  "lineNumber": 118,
+                  "content": "        return segments"
+                },
+                {
+                  "lineNumber": 119,
+                  "content": "    segments = []"
+                },
+                {
+                  "lineNumber": 123,
+                  "content": "        end_target = start + segment_seconds"
+                },
+                {
+                  "lineNumber": 131,
+                  "content": "        segments.append((start, end))"
+                },
+                {
+                  "lineNumber": 133,
+                  "content": "    return segments"
+                },
+                {
+                  "lineNumber": 354,
+                  "content": "def encode(hosts: List[str], input_file: str, output_file: str, segment_seconds: float = 60, remote_args: str = '', concat_args: str = '', tmp_dir: str = None, keep_tmp=False, resume=False, copy_input=False, probe_host: str = None, probe_path: str = None, remote_ffmpeg_path: str = None):"
+                },
+                {
+                  "lineNumber": 357,
+                  "content": "    tmp_dir = tmp_dir or 'ffmpeg_segments_'+md5(input_file.encode()).hexdigest()"
+                },
+                {
+                  "lineNumber": 361,
+                  "content": "        pass  # previous job: resume and re-queue failed segments"
+                },
+                {
+                  "lineNumber": 368,
+                  "content": "    segments = _build_segments(keyframe_times, segment_seconds, total_duration)"
+                },
+                {
+                  "lineNumber": 370,
+                  "content": "    if not segments:"
+                },
+                {
+                  "lineNumber": 371,
+                  "content": "        tqdm.write(f'No segments for {input_file} (keyframes={len(keyframe_times)}, duration={total_duration}). Check ffprobe.', file=stderr)"
+                },
+                {
+                  "lineNumber": 374,
+                  "content": "    MIN_SEGMENT_BYTES = 1024  # smaller = failed/corrupt segment, re-encode"
+                },
+                {
+                  "lineNumber": 378,
+                  "content": "    for i, (start_sec, end_sec) in enumerate(segments):"
+                },
+                {
+                  "lineNumber": 380,
+                  "content": "        output_path = f'{tmp_dir}/{i:08d}.ts'"
+                },
+                {
+                  "lineNumber": 381,
+                  "content": "        if isfile(output_path):"
+                },
+                {
+                  "lineNumber": 383,
+                  "content": "                if getsize(output_path) &lt; MIN_SEGMENT_BYTES:"
+                },
+                {
+                  "lineNumber": 384,
+                  "content": "                    unlink(output_path)"
+                },
+                {
+                  "lineNumber": 386,
+                  "content": "                elif not _segment_valid(output_path, duration_sec, DURATION_TOLERANCE_SEC):"
+                },
+                {
+                  "lineNumber": 387,
+                  "content": "                    unlink(output_path)"
+                },
+                {
+                  "lineNumber": 393,
+                  "content": "        task_queue.put(Task(start_sec, duration_sec, output_path, split(remote_args)))"
+                },
+                {
+                  "lineNumber": 397,
+                  "content": "        tqdm.write(f'[2/4] Removed {removed} bad segment(s) (ffprobe failed or duration mismatch), re-queued.', file=stderr)"
+                },
+                {
+                  "lineNumber": 399,
+                  "content": "        tqdm.write('All segments already done (resume).', file=stderr)"
+                },
+                {
+                  "lineNumber": 401,
+                  "content": "        tqdm.write(f'[2/4] Queued {n_tasks} segments (keyframes={len(keyframe_times)}, duration={total_duration:.1f}s)', file=stderr)"
+                },
+                {
+                  "lineNumber": 403,
+                  "content": "    dprint(f'Segments: {len(segments)} total, {n_tasks} tasks queued')"
+                },
+                {
+                  "lineNumber": 405,
+                  "content": "    tqdm.write(f'[3/4] Encoding segments on {len(hosts)} host(s)...', file=stderr)"
+                },
+                {
+                  "lineNumber": 471,
+                  "content": "    list_path = f'{tmp_dir}/output_segments.txt'"
+                },
+                {
+                  "lineNumber": 473,
+                  "content": "        f.write('\\n'.join([f\"file '{fpath}'\" for fpath in sorted(glob(f'{tmp_dir}/*.ts'))]))"
+                },
+                {
+                  "lineNumber": 475,
+                  "content": "    tqdm.write('[4/4] Concatenating segments and muxing with audio...', file=stderr)"
+                },
+                {
+                  "lineNumber": 476,
+                  "content": "    concat_extra = ['-stats_period', '5'] if verbose else []"
+                },
+                {
+                  "lineNumber": 477,
+                  "content": "    concat_cmd = ["
+                },
+                {
+                  "lineNumber": 478,
+                  "content": "        'ffmpeg', *concat_extra, '-i', input_file,"
+                },
+                {
+                  "lineNumber": 479,
+                  "content": "        '-f', 'concat', '-safe', '0', '-i', list_path,"
+                },
+                {
+                  "lineNumber": 483,
+                  "content": "        *split(concat_args),"
+                },
+                {
+                  "lineNumber": 486,
+                  "content": "    tqdm.write('Concat ffmpeg: ' + ' '.join(join([x]) for x in concat_cmd), file=stderr)"
+                },
+                {
+                  "lineNumber": 487,
+                  "content": "    with TqdmAbsolute(desc='concatenating output segments') as bar:"
+                },
+                {
+                  "lineNumber": 491,
+                  "content": "        ffmpeg = FFMPEGProc(concat_cmd, update_callback=upd, echo_stderr=verbose)"
+                },
+                {
+                  "lineNumber": 502,
+                  "content": "    parser = argparse.ArgumentParser(description='Splits a file into segments and processes them on multiple hosts in parallel using ffmpeg over SSH.')"
+                },
+                {
+                  "lineNumber": 506,
+                  "content": "    parser.add_argument('concat_args', default='', help='Arguments to pass to the local ffmpeg concatenating the processed video segments and muxing it with the original audio/subs/metadata. Mainly useful for audio encoding options, or \"-an\" to get rid of it.')"
+                },
+                {
+                  "lineNumber": 507,
+                  "content": "    parser.add_argument('-s', '--segment-length', type=float, default=10, help='Segment length in seconds.')"
+                },
+                {
+                  "lineNumber": 509,
+                  "content": "    parser.add_argument('-k', '--keep-tmp', action='store_true', help='Keep temporary segment files instead of deleting them on successful exit.')"
+                },
+                {
+                  "lineNumber": 510,
+                  "content": "    parser.add_argument('-r', '--resume', action='store_true', help='Don\\'t split the input file again, keep existing segments and only process the missing ones.')"
+                },
+                {
+                  "lineNumber": 512,
+                  "content": "    parser.add_argument('-c', '--copy-input', action='store_true', help='Don\\'t (losslessly) re-encode input while segmenting. Only use this if your input segments frame-perfectly with \"-c:v copy\" (i.e. it has no B-frames)')"
+                },
+                {
+                  "lineNumber": 520,
+                  "content": "        segment_seconds=args.segment_length,"
+                },
+                {
+                  "lineNumber": 522,
+                  "content": "        concat_args=args.concat_args,"
+                }
+              ]
+            }
+          ],
+          "totalLines": 53,
+          "totalMatchedLines": 53
+        }
+      }
+    }
+  }
+}
+```
+Status: **completed**
+
+
+</details>
+</tool-use>
+
+---
+
+<think><details><summary>Thought Process</summary>
+
+Now I have a clear picture.
+
+**1. NVENC HEVC settings from encode_VOD.py (lines 201-226):**
+- `-c:v hevc_nvenc`
+- `-gpu`, gpu  (GPU index)
+- `-preset p7`, `-tune hq`
+- `-rc vbr`, `-rc-lookahead 32`
+- `-spatial-aq 1`, `-aq-strength 15`
+- `-cq 0`, `-b:v 10000k`, `-maxrate 10000k`, `-bufsize 20000k`
+- `-c:a copy`, `-map 0`, then metadata for audio labels, then output
+
+For distributed we won't have audio in segments (segments are video-only), so remote args would be something like:
+`-c:v hevc_nvenc -gpu 0 -preset p7 -tune hq -rc vbr -rc-lookahead 32 -spatial-aq 1 -aq-strength 15 -cq 0 -b:v 10000k -maxrate 10000k -bufsize 20000k -an`
+And we need to pass the GPU index per worker.
+
+**2. Host/GPU model:**
+- Current: hosts = list of SSH host strings, one thread per host.
+- New: "workers" = list of (host, gpu_id). So:
+  - Pyro -> ("Pyro", 0)
+  - PostIrony -> ("PostIrony", 0)
+  - RenderScrap -> ("RenderScrap", 0) and ("RenderScrap", 1)  -> 2 workers
+  - GuiltsCurse, Godzilla -> excluded
+
+So we need to change from `hosts: List[str]` to something like `workers: List[Tuple[str, int]]` where each entry is (ssh_host, gpu_index). The TaskThread would then need to receive (host, gpu_id) and pass -gpu gpu_id in the encoder command. The display name could be f"{host}:gpu{gpu_id}" or "RenderScrap:0", "RenderScrap:1".
+
+**3. MP4 vs TS:**
+- Currently: reader outputs mpegts to pipe, encoder reads mpegts from pipe and writes **.ts** segment files, concat list is *.ts.
+- For HEVC we can: keep reader->pipe as mpegts (or use hevc in mpegts for pipe - actually the reader does -c:v copy so it's copying the input codec; if input is HEVC we'd be piping HEVC. For simplicity we could keep reader as mpegts - re-encode to mpegts for pipe so the encoder gets a known format. Actually the current reader does -c:v copy -f mpegts so it's copying video codec (could be hevc) into mpegts container to pipe. So decoder on the other side reads mpegts. For NVENC we'd have: reader (mpegts pipe) -> remote ffmpeg -f mpegts -i pipe: -c:v hevc_nvenc -gpu N ... -f mp4 segment.mp4. So encoder outputs to .mp4 file. Yes!
+- Segment extension: .mp4 instead of .ts
+- Concat list: glob *.mp4, output_segments.txt lists .mp4 files
+- Pipe format: can stay mpegts (reader) -> decoder on remote; encoder writes to .mp4. So we need to change: output_path from .ts to .mp4, encoder -f mpegts pipe:1 to writing to file (we're already writing to file with stdout=outfile). So the remote encoder command currently is: ffmpeg -f mpegts -i pipe: [args] -f mpegts pipe:1 and we write that to a file. So we're writing mpegts to the segment file. To write MP4 we'd do: ffmpeg -f mpegts -i pipe: [args] -f mp4 segment.mp4  (output to file, not pipe). So we need to change the remote command so it writes to a file path. But wait - the way it works now is: we run ssh host "ffmpeg ... pipe:1" and we redirect that to a local file (outfile). So the segment file is written on the **controller** (Windows) by the SSH stdout. So the encoder on the remote outputs to stdout (pipe:1) and we capture that. So the segment file is actually written locally. So the format of the segment file is whatever we pass to -f. So we could do -f mp4 and the remote would output mp4 to stdout and we'd write it to segment.mp4. So yes: change -f mpegts to -f mp4 for the encoder output, and change segment extension from .ts to .mp4. The concat step then uses *.mp4. Good.
+
+One caveat: MP4 to stdout can be tricky because MP4 typically wants to seek to write the moov at the end. Some encoders support fragmented MP4 (e.g. -movflags frag_keyframe+empty_moov) for streaming. Let me check - for hevc_nvenc output to pipe, we might need -movflags frag_keyframe+empty_moov so the mp4 can be written sequentially. I'll add that to the plan.
+
+Creating the plan.
+</details></think>
+
+---
+
+
+Checking how the encoder writes the segment (pipe vs file) and whether MP4 to stdout needs movflags:
+
+
+---
+
+<tool-use data-tool-type="search" data-tool-name="web_search">
+<details>
+            <summary>Tool use: **web_search** • Searched web `undefined` • **1** results</summary>
+        
+| Title | URL | Chunk |
+|-------|-----|-------|
+| Web search results for query: ffmpeg hevc_nvenc output mp4 to pipe movflags frag_keyframe |  | Links:<br/>1. [nanake / ffmpeg-hevc-encode-nvenc.md](https://gist.github.com/nanake/e5e747f386904911371eee25277c81de)<br/>2. [devjj / ffmpeg-hevc-encode-nvenc.md](https://gist.github.com/devjj/da92e752b2a7ac9371492af4223a18d6/revisions)<br/>3. [Let's set up your homepage   Select a few topics you're interested in:](https://stackoverflow.com/questions/55698581/create-mp4-file-from-raw-h264-using-a-pipe-instead-of-files)<br/>4. [Piping MP4 from FFmpeg](https://www.jaburjak.cz/posts/ffmpeg-pipe-mp4/)<br/>5. [How to output fragmented mp4 with ffmpeg?](https://www.exchangetuts.com/how-to-output-fragmented-mp4-with-ffmpeg-1639584787079460)<br/><br/># FFmpeg HEVC_NVENC Output to MP4 Pipe with Fragmented MP4<br/><br/>## Key Solution: Use `movflags` for Pipe Output<br/><br/>To output HEVC-encoded video to a pipe as MP4, you need to use fragmented MP4 format because standard MP4 requires seeking, which pipes don't support.[4]<br/><br/>## Required Command Structure<br/><br/>Use the `-movflags` option with fragmentation flags:[4][5]<br/><br/>```<br/>ffmpeg -i input -c:v hevc_nvenc -f mp4 -movflags frag_keyframe+empty_moov -<br/>```<br/><br/>## movflags Options Explained<br/><br/>- **`frag_keyframe`**: Creates fragmented output where each keyframe starts a new fragment[4][5]<br/>- **`empty_moov`**: Places the `moov` atom at the beginning and makes it empty, followed by fragmented `moof` and `mdat` atoms[4][5]<br/><br/>## Why This Is Necessary<br/><br/>Standard MP4 files use atoms with size headers that must be calculated retroactively by seeking backward—impossible with pipes. Fragmented MP4 splits tracks into small fragments that FFmpeg can keep in memory and output sequentially without seeking.[4]<br/><br/>## Trade-off<br/><br/>Fragmented MP4s have reduced compatibility. Some players (like Windows 10's built-in video player) may report zero duration and have unusable seek bars, though playback typically works.[4]<br/><br/>For HEVC NVENC encoding specifically, apply these `movflags` to your hevc_nvenc command as shown in the structure above.[1][2] |
+
+</details>
+</tool-use>
+
+---
+
+<tool-use data-tool-type="read" data-tool-name="read_file_v2">
+<details><summary>Tool use: **read_file_v2** • Read file: r:\Videos\mux\ffmpeg_distributed.py</summary>
+
+</details>
+</tool-use>
+
+---
+
+<think><details><summary>Thought Process</summary>
+
+So for MP4 to pipe we need -movflags frag_keyframe+empty_moov. The concat demuxer in ffmpeg can concat fragmented MP4s - I believe it works. So the plan:
+
+1. **encode_VOD_pyro.py**
+   - Change DISTRIBUTED_HOSTS_DEFAULT to a list of (host, gpu_id) "workers": Pyro:0, RenderScrap:0, RenderScrap:1, PostIrony:0 (no Unraid).
+   - Change DISTRIBUTED_REMOTE_ARGS_DEFAULT to NVENC HEVC from encode_VOD.py (without -gpu, that will be injected per worker).
+   - Pass workers into ffmpeg_distributed.encode() instead of hosts.
+
+2. **ffmpeg_distributed.py**
+   - Change to accept workers: List[Tuple[str, int]] or List of (host, gpu_id). Each thread gets (host, gpu_id).
+   - TaskThread: accept (host, gpu_id), store both; display as "host:gpuN"; when building encoder_cmd, inject -gpu gpu_id and use remote_ffmpeg_path for host.
+   - Segment extension: .ts -> .mp4 everywhere (output_path, glob, list_path).
+   - Reader: keep -f mpegts pipe:1 (unchanged).
+   - Encoder: -f mpegts -i pipe: [remote_args with -gpu N] -f mp4 -movflags frag_keyframe+empty_moov pipe:1 (so we write to outfile which gets .mp4 bytes). So remote_args should NOT contain -gpu; we add -gpu {gpu_id} per worker.
+   - Concat: glob *.mp4, list_path lists .mp4 files; concat step stays the same (map 1:v etc).
+
+3. **NVENC remote args** (from encode_VOD.py, video-only for segments):
+   - -c:v hevc_nvenc -preset p7 -tune hq -rc vbr -rc-lookahead 32 -spatial-aq 1 -aq-strength 15 -cq 0 -b:v 10000k -maxrate 10000k -bufsize 20000k -an
+   - We add -gpu {gpu_id} in the distributed code when building the encoder command.
+
+Let me also confirm: for concat of fragmented MP4 segments, does ffmpeg concat demuxer work? Yes - the concat demuxer just concatenates the streams, so as long as all segments are HEVC and we use -c:v copy for concat it should work.
+
+I'll add one more thing: the Task structure currently has ffmpeg_args as a list (from split(remote_args)). We need to inject -gpu gpu_id. So we could either (a) pass gpu_id in Task and have the thread add it when building the command, or (b) have the thread add -gpu self._gpu_id when building encoder_cmd. Option (b) is cleaner - Task stays as is, TaskThread has _gpu_id and when it builds encoder_cmd it does [ffmpeg_bin, '-f', 'mpegts', '-i', 'pipe:', '-gpu', str(self._gpu_id), *task.ffmpeg_args, '-f', 'mp4', '-movflags', 'frag_keyframe+empty_moov', 'pipe:1']. So remote_args in the config should not include -gpu.
+
+Creating the plan.
+</details></think>
+
+---
+
+
+Creating the plan.
+
+
+---
+
+<tool-use data-tool-type="unknown" data-tool-name="create_plan">
+<details>
+<summary>Tool use: **create_plan**</summary>
+
+
+Parameters:
+
+```json
+{
+  "plan": "# NVENC HEVC distributed encode with per-host GPU and MP4 segments\n\n## 1. Copy NVENC HEVC settings from encode_VOD.py\n\nFrom [encode_VOD.py](r:\\Videos\\mux\\encode_VOD.py) (lines 201–226), the local encode uses:\n\n- &#96;-c:v hevc_nvenc&#96; with &#96;-gpu&#96; (index)\n- &#96;-preset p7&#96;, &#96;-tune hq&#96;\n- &#96;-rc vbr&#96;, &#96;-rc-lookahead 32&#96;\n- &#96;-spatial-aq 1&#96;, &#96;-aq-strength 15&#96;\n- &#96;-cq 0&#96;, &#96;-b:v 10000k&#96;, &#96;-maxrate 10000k&#96;, &#96;-bufsize 20000k&#96;\n\nFor **distributed segments** (video-only, no audio): use the same video options and &#96;-an&#96;. The &#96;-gpu&#96; index will be injected per worker in the distributed code, not in the shared remote args string.\n\n**Proposed default remote args** (no &#96;-gpu&#96;; added per worker):\n\n&#96;&#96;&#96;text\n-c:v hevc_nvenc -preset p7 -tune hq -rc vbr -rc-lookahead 32 -spatial-aq 1 -aq-strength 15 -cq 0 -b:v 10000k -maxrate 10000k -bufsize 20000k -an\n&#96;&#96;&#96;\n\n---\n\n## 2. Workers = (host, gpu_id); exclude Unraid; RenderScrap = 2 workers\n\n**Current:** &#96;hosts: List[str]&#96; — one thread per host (Pyro, RenderScrap, root@GuiltsCurse, PostIrony, root@Godzilla).\n\n**New:** &#96;workers: List[Tuple[str, int]]&#96; — each entry is &#96;(ssh_host, gpu_index)&#96;. One thread per (host, gpu).\n\n- **Pyro:** gpu 0 → &#96;(\"Pyro\", 0)&#96;\n- **PostIrony:** gpu 0 → &#96;(\"PostIrony\", 0)&#96;\n- **RenderScrap:** gpu 0 and gpu 1 → &#96;(\"RenderScrap\", 0)&#96;, &#96;(\"RenderScrap\", 1)&#96;\n- **GuiltsCurse, Godzilla:** not in the list (Unraid, no valid GPUs)\n\nSo the default workers list becomes:\n\n&#96;&#96;&#96;python\nDISTRIBUTED_WORKERS_DEFAULT = [\n    (\"Pyro\", 0),\n    (\"RenderScrap\", 0),\n    (\"RenderScrap\", 1),\n    (\"PostIrony\", 0),\n]\n&#96;&#96;&#96;\n\n- [encode_VOD_pyro.py](r:\\Videos\\mux\\encode_VOD_pyro.py): replace &#96;DISTRIBUTED_HOSTS_DEFAULT&#96; with &#96;DISTRIBUTED_WORKERS_DEFAULT&#96;; all call sites that pass “hosts” should pass “workers” and any env (e.g. &#96;DISTRIBUTED_HOSTS&#96;) can be replaced with something like &#96;DISTRIBUTED_WORKERS&#96; (e.g. parse &#96;\"Pyro:0,RenderScrap:0,RenderScrap:1,PostIrony:0\"&#96;).\n- [ffmpeg_distributed.py](r:\\Videos\\mux\\ffmpeg_distributed.py): &#96;encode()&#96; and &#96;TaskThread&#96; take workers instead of hosts. Each thread gets &#96;(host, gpu_id)&#96;; display name e.g. &#96;Pyro:gpu0&#96;, &#96;RenderScrap:gpu0&#96;, &#96;RenderScrap:gpu1&#96;, &#96;PostIrony:gpu0&#96;. When building the remote encoder command, add &#96;-gpu str(gpu_id)&#96; (and keep using &#96;remote_ffmpeg_path&#96; for the binary on Unraid if you later add back a non-NVENC path).\n\n---\n\n## 3. MP4 segments and pipe-safe output\n\n**Current:** Reader outputs mpegts to pipe; encoder reads mpegts and writes **.ts** segments; concat uses &#96;*.ts&#96; and &#96;output_segments.txt&#96; listing &#96;.ts&#96; files.\n\n**New (HEVC):**\n\n- **Segment extension:** &#96;.ts&#96; → &#96;.mp4&#96; everywhere: segment paths, resume/validation, glob for concat list, and &#96;output_segments.txt&#96; content.\n- **Encoder output:** Remote encoder currently does &#96;-f mpegts pipe:1&#96; and the controller writes that stream to the segment file. Change to MP4:\n  - Use **&#96;-f mp4 -movflags frag_keyframe+empty_moov pipe:1&#96;** so MP4 can be written sequentially over pipe (standard MP4 needs seeking; fragmented MP4 is pipe-safe). Segments will be fragmented MP4 (playback fine; concat demuxer works with them).\n- **Reader:** Keep &#96;-f mpegts pipe:1&#96; (no change).\n- **Concat step:** Glob &#96;*.mp4&#96;, list those in &#96;output_segments.txt&#96;; concat command stays &#96;-f concat -safe 0 -i list_path&#96;, &#96;-map 1:v&#96;, etc. No need for TS anymore.\n\n**Files to touch in ffmpeg_distributed.py:**\n\n- Segment path: &#96;f'{tmp_dir}/{i:08d}.ts'&#96; → &#96;f'{tmp_dir}/{i:08d}.mp4'&#96;\n- Glob for concat: &#96;glob(f'{tmp_dir}/*.ts')&#96; → &#96;glob(f'{tmp_dir}/*.mp4')&#96;\n- Encoder command: after &#96;*task.ffmpeg_args&#96;, add &#96;-gpu str(self._gpu_id)&#96; (from TaskThread’s worker), then &#96;-f mp4 -movflags frag_keyframe+empty_moov pipe:1&#96; instead of &#96;-f mpegts pipe:1&#96;.\n\n---\n\n## 4. Implementation summary\n\n| Area | Change |\n|------|--------|\n| **encode_VOD_pyro.py** | &#96;DISTRIBUTED_HOSTS_DEFAULT&#96; → &#96;DISTRIBUTED_WORKERS_DEFAULT&#96; (list of (host, gpu)); &#96;DISTRIBUTED_REMOTE_ARGS_DEFAULT&#96; = NVENC HEVC string above (no &#96;-gpu&#96;); pass workers into distributed encode; optional env &#96;DISTRIBUTED_WORKERS&#96; (e.g. &#96;Pyro:0,RenderScrap:0,RenderScrap:1,PostIrony:0&#96;). |\n| **ffmpeg_distributed.py** | &#96;encode(workers: List[Tuple[str, int]], ...)&#96;; &#96;TaskThread(host, gpu_id, ...)&#96; with &#96;_host_tag()&#96; like &#96;RenderScrap:gpu1&#96;; encoder_cmd includes &#96;-gpu str(gpu_id)&#96; and &#96;-f mp4 -movflags frag_keyframe+empty_moov pipe:1&#96;; segment paths and concat glob/list use &#96;.mp4&#96;. |\n| **CLI** | If ffmpeg_distributed is invoked directly, add a way to pass workers (e.g. &#96;-H host:gpu&#96; repeated, or keep &#96;-H host&#96; and add optional &#96;--gpu&#96; per host; minimal change is to accept workers as the primary interface). |\n\n---\n\n## 5. Data flow (unchanged structure, new types and formats)\n\n&#96;&#96;&#96;mermaid\nflowchart LR\n    subgraph controller [Controller]\n        Queue[Task queue]\n        Concat[Concat step]\n    end\n    subgraph workers [Workers]\n        W1[\"Pyro:gpu0\"]\n        W2[\"RenderScrap:gpu0\"]\n        W3[\"RenderScrap:gpu1\"]\n        W4[\"PostIrony:gpu0\"]\n    end\n    Queue --&gt; W1\n    Queue --&gt; W2\n    Queue --&gt; W3\n    Queue --&gt; W4\n    W1 --&gt; Segs[.mp4 segments]\n    W2 --&gt; Segs\n    W3 --&gt; Segs\n    W4 --&gt; Segs\n    Segs --&gt; Concat\n&#96;&#96;&#96;\n\n---\n\n## 6. Optional: remote_ffmpeg_path and Unraid\n\nUnraid hosts are no longer in the worker list, so &#96;DISTRIBUTED_REMOTE_FFMPEG_PATH&#96; / &#96;remote_ffmpeg_path&#96; are only relevant if you add non-Unraid hosts that need a custom ffmpeg path. No change required for the “exclude Unraid” design; keep the existing parameter so RenderScrap/Pyro/PostIrony can still use default &#96;ffmpeg&#96; on PATH.\n",
+  "name": "NVENC HEVC distributed workers",
+  "overview": "Switch distributed encode from AV1 (libaom) to NVENC H.265 using encode_VOD.py settings; model workers as (host, gpu_id) so RenderScrap gets two workers (gpu0 and gpu1), Unraid hosts are excluded; use MP4 for segment and concat with movflags for pipe-safe output."
+}
+```
+Additional data:
+
+```json
+{
+  "reviewData": {
+    "status": "Requested",
+    "selectedOption": "none",
+    "isShowingInput": false
+  },
+  "planUri": "file:///c%3A/Users/Nathan/.cursor/plans/nvenc_hevc_distributed_workers_efd618b8.plan.md",
+  "hasOpenedEditor": true
+}
+```
+Result:
+
+```json
+{
+  "rejected": {}
+}
+```
+Status: **completed**
+
+
+</details>
+</tool-use>
+
+---
+
+_**User**_
+
+Implement the plan as specified, it is attached for your reference. Do NOT edit the plan file itself.
+
+To-do's from the plan have already been created. Do not create them again. Mark them as in_progress as you work, starting with the first one. Don't stop until you have completed all the to-dos.
+
+---
+
+_**Agent (model default, mode Agent)**_
+
+
+
+---
+
+
+
+---
+
+<tool-use data-tool-type="task" data-tool-name="todo_write">
+<details><summary>Tool use: **todo_write** • Error parsing todo list data</summary>
+
+</details>
+</tool-use>
+
+---
+
diff --git a/encode_VOD_pyro.py b/encode_VOD_pyro.py
index c56cc39..65d1a9b 100644
--- a/encode_VOD_pyro.py
+++ b/encode_VOD_pyro.py
@@ -11,10 +11,34 @@ import shutil
 import time
 
 # Distributed mode requires tqdm and ffmpeg_distributed.py (SSH, Unix select.poll); on Windows use WSL or Linux.
-DISTRIBUTED_HOSTS_DEFAULT = ["Pyro", "RenderScrap", "root@GuiltsCurse", "PostIrony", "root@Godzilla"]
-DISTRIBUTED_REMOTE_ARGS_DEFAULT = "-c:v libaom-av1 -crf 0 -b:v 9000k -maxrate 9000k -bufsize 18000k -cpu-used 8 -row-mt 1 -an"
+# Workers = (ssh_host, gpu_index). Unraid (GuiltsCurse, Godzilla) excluded; RenderScrap has 2 GPUs.
+DISTRIBUTED_WORKERS_DEFAULT = [
+    ("Pyro", 0),
+    ("RenderScrap", 0),
+    ("RenderScrap", 1),
+    ("PostIrony", 0),
+]
+DISTRIBUTED_REMOTE_ARGS_DEFAULT = "-c:v hevc_nvenc -preset p7 -tune hq -rc vbr -rc-lookahead 32 -spatial-aq 1 -aq-strength 15 -cq 0 -b:v 9000k -maxrate 9000k -bufsize 18000k -an"
 DISTRIBUTED_SEGMENT_SECONDS = 60
 
+
+def _parse_workers_env(s):
+    """Parse DISTRIBUTED_WORKERS e.g. 'Pyro:0,RenderScrap:0,RenderScrap:1,PostIrony:0' -> [(host, gpu_id), ...]."""
+    out = []
+    for part in (s or "").strip().split(","):
+        part = part.strip()
+        if not part:
+            continue
+        if ":" in part:
+            host, gpu = part.rsplit(":", 1)
+            try:
+                out.append((host.strip(), int(gpu.strip())))
+            except ValueError:
+                pass
+        else:
+            out.append((part, 0))
+    return out
+
 # ANSI color codes
 class Colors:
     PURPLE = '\033[95m'
@@ -342,9 +366,9 @@ def encode_dvr(input_file, output_dir, gpu):
                       f"{Colors.RED}Unexpected error encoding {input_path}: {e}{Colors.ENDC}")
 
 
-def encode_dvr_distributed(input_file, output_dir, hosts, segment_seconds=60, remote_args=None, concat_args="-c:a copy", probe_host=None, probe_path=None, remote_ffmpeg_path=None):
-    """Encode one file using ffmpeg_distributed (split -> farm -> concat). Segment temp dirs go under script dir/tmp/.
-    If probe_host and probe_path are set, ffprobe runs there (faster when input is on NAS)."""
+def encode_dvr_distributed(input_file, output_dir, workers, segment_seconds=60, remote_args=None, concat_args="-c:a copy", probe_host=None, probe_path=None, remote_ffmpeg_path=None):
+    """Encode one file using ffmpeg_distributed (split -> farm -> concat). workers = [(host, gpu_id), ...].
+    Segment temp dirs go under script dir/tmp/. If probe_host and probe_path are set, ffprobe runs there (faster when input is on NAS)."""
     input_path = Path(input_file).resolve()
     output_path = (Path(output_dir) / f"{input_path.stem}{input_path.suffix}").resolve()
     if output_path.exists():
@@ -366,11 +390,11 @@ def encode_dvr_distributed(input_file, output_dir, hosts, segment_seconds=60, re
     try:
         os.chdir(output_dir)
         from ffmpeg_distributed import encode as distributed_encode
-        safe_log_info(f"Distributed encode: {input_path} -> {output_path} (hosts: {hosts})")
-        print(f"{Colors.BLUE}Distributed encode (AV1): {input_path.name}{Colors.ENDC}")
+        safe_log_info(f"Distributed encode: {input_path} -> {output_path} (workers: {workers})")
+        print(f"{Colors.BLUE}Distributed encode (HEVC): {input_path.name}{Colors.ENDC}")
         remote_ffmpeg = remote_ffmpeg_path or os.environ.get("DISTRIBUTED_REMOTE_FFMPEG_PATH")
         distributed_encode(
-            hosts,
+            workers,
             str(input_path),
             str(output_path),
             segment_seconds=segment_seconds,
@@ -399,13 +423,14 @@ if __name__ == "__main__":
     output_dir = "output"
     os.makedirs(output_dir, exist_ok=True)
 
-    hosts_str = os.environ.get("DISTRIBUTED_HOSTS")
-    if hosts_str:
-        hosts = [h.strip() for h in hosts_str.split(",") if h.strip()]
+    workers_str = os.environ.get("DISTRIBUTED_WORKERS")
+    if workers_str:
+        workers = _parse_workers_env(workers_str)
     else:
-        hosts = DISTRIBUTED_HOSTS_DEFAULT
-        print(f"{Colors.BLUE}Using hosts: {', '.join(hosts)}{Colors.ENDC}")
-    safe_log_info(f"Distributed mode; hosts: {hosts}")
+        workers = DISTRIBUTED_WORKERS_DEFAULT
+    workers_desc = ", ".join(f"{h}:gpu{g}" for h, g in workers)
+    print(f"{Colors.BLUE}Using workers: {workers_desc}{Colors.ENDC}")
+    safe_log_info(f"Distributed mode; workers: {workers}")
 
     files = [f for f in os.listdir(input_dir) if f.endswith(('.mp4', '.DVR.mp4'))]
     total_files = len(files)
@@ -419,4 +444,4 @@ if __name__ == "__main__":
             input_file = os.path.join(input_dir, file)
             safe_log_info(f"Processing file {i}/{total_files}: {file}")
             print(f"\n{Colors.BLUE}Processing file {i}/{total_files}: {file}{Colors.ENDC}")
-            encode_dvr_distributed(input_file, output_dir, hosts, segment_seconds=DISTRIBUTED_SEGMENT_SECONDS) 
\ No newline at end of file
+            encode_dvr_distributed(input_file, output_dir, workers, segment_seconds=DISTRIBUTED_SEGMENT_SECONDS) 
\ No newline at end of file
diff --git a/ffmpeg_distributed.py b/ffmpeg_distributed.py
index 765b504..6326745 100644
--- a/ffmpeg_distributed.py
+++ b/ffmpeg_distributed.py
@@ -257,21 +257,23 @@ HOST_COLORS = ['\033[94m', '\033[92m', '\033[93m', '\033[95m', '\033[96m', '\033
 RESET = '\033[0m'
 
 class TaskThread(Thread):
-    def __init__(self, host: str, source_file: str, task_queue: SimpleQueue, bar_pos: int, remote_ffmpeg_path: str = None):
+    def __init__(self, host: str, gpu_id: int, source_file: str, task_queue: SimpleQueue, bar_pos: int, remote_ffmpeg_path: str = None):
         super().__init__()
         self._should_stop = False
         self._host = host
+        self._gpu_id = gpu_id
+        self._host_desc = f"{host}:gpu{gpu_id}"
         self._bar_pos = bar_pos
         self._remote_ffmpeg_path = remote_ffmpeg_path
         self._source_file = source_file
         self._task_queue = task_queue
         self._ffmpeg = None
-        self._bar = TqdmAbsolute(desc=host, position=bar_pos)
+        self._bar = TqdmAbsolute(desc=self._host_desc, position=bar_pos)
         self._current_file = None
 
     def _host_tag(self):
         c = HOST_COLORS[self._bar_pos % len(HOST_COLORS)]
-        return f'{c}{self._host}{RESET}'
+        return f'{c}{self._host_desc}{RESET}'
 
     def stop(self):
         self._should_stop = True
@@ -287,7 +289,7 @@ class TaskThread(Thread):
         last_log = [0.0]  # mutable for progress heartbeat
         def upd(frames, fps, t, duration, speed):
             self._bar.total = duration or 999
-            self._bar.desc = self._host + ': ' + (self._current_file or '')
+            self._bar.desc = self._host_desc + ': ' + (self._current_file or '')
             self._bar.update(t)
             if duration and duration > 0 and (time() - last_log[0]) >= 30:
                 tqdm.write(f'  {self._host_tag()}: {self._current_file} {t:.0f}s / {duration:.0f}s ({speed:.1f}x)', file=stderr)
@@ -307,8 +309,9 @@ class TaskThread(Thread):
                 ffmpeg_bin = (self._remote_ffmpeg_path or 'ffmpeg') if self._host != 'localhost' else 'ffmpeg'
                 encoder_cmd = [
                     ffmpeg_bin, '-f', 'mpegts', '-i', 'pipe:',
+                    '-gpu', str(self._gpu_id),
                     *task.ffmpeg_args,
-                    '-f', 'mpegts', 'pipe:1'
+                    '-f', 'mp4', '-movflags', 'frag_keyframe+empty_moov', 'pipe:1'
                 ]
                 if self._host == 'localhost' and sys_platform != 'win32':
                     encoder_cmd = ['nice', '-n10', 'ionice', '-c3'] + encoder_cmd
@@ -351,7 +354,7 @@ class TaskThread(Thread):
             pass
         self._bar.close()
 
-def encode(hosts: List[str], input_file: str, output_file: str, segment_seconds: float = 60, remote_args: str = '', concat_args: str = '', tmp_dir: str = None, keep_tmp=False, resume=False, copy_input=False, probe_host: str = None, probe_path: str = None, remote_ffmpeg_path: str = None):
+def encode(workers: List[Tuple[str, int]], input_file: str, output_file: str, segment_seconds: float = 60, remote_args: str = '', concat_args: str = '', tmp_dir: str = None, keep_tmp=False, resume=False, copy_input=False, probe_host: str = None, probe_path: str = None, remote_ffmpeg_path: str = None):
     input_file = abspath(expanduser(input_file))
     output_file = abspath(expanduser(output_file))
     tmp_dir = tmp_dir or 'ffmpeg_segments_'+md5(input_file.encode()).hexdigest()
@@ -377,7 +380,7 @@ def encode(hosts: List[str], input_file: str, output_file: str, segment_seconds:
     removed = 0
     for i, (start_sec, end_sec) in enumerate(segments):
         duration_sec = end_sec - start_sec
-        output_path = f'{tmp_dir}/{i:08d}.ts'
+        output_path = f'{tmp_dir}/{i:08d}.mp4'
         if isfile(output_path):
             try:
                 if getsize(output_path) < MIN_SEGMENT_BYTES:
@@ -402,9 +405,9 @@ def encode(hosts: List[str], input_file: str, output_file: str, segment_seconds:
     stderr.flush()
     dprint(f'Segments: {len(segments)} total, {n_tasks} tasks queued')
 
-    tqdm.write(f'[3/4] Encoding segments on {len(hosts)} host(s)...', file=stderr)
+    tqdm.write(f'[3/4] Encoding segments on {len(workers)} worker(s)...', file=stderr)
     stderr.flush()
-    threads = [TaskThread(host, input_file, task_queue, pos, remote_ffmpeg_path) for pos, host in enumerate(hosts, 0)]
+    threads = [TaskThread(host, gpu_id, input_file, task_queue, pos, remote_ffmpeg_path) for pos, (host, gpu_id) in enumerate(workers, 0)]
 
     def stop_all():
         tqdm.write('Stopping all workers (killing ffmpeg/SSH on each host)...', file=stderr)
@@ -470,7 +473,7 @@ def encode(hosts: List[str], input_file: str, output_file: str, segment_seconds:
 
     list_path = f'{tmp_dir}/output_segments.txt'
     with open(list_path, 'w') as f:
-        f.write('\n'.join([f"file '{fpath}'" for fpath in sorted(glob(f'{tmp_dir}/*.ts'))]))
+        f.write('\n'.join([f"file '{fpath}'" for fpath in sorted(glob(f'{tmp_dir}/*.mp4'))]))
 
     tqdm.write('[4/4] Concatenating segments and muxing with audio...', file=stderr)
     concat_extra = ['-stats_period', '5'] if verbose else []
@@ -497,15 +500,33 @@ def encode(hosts: List[str], input_file: str, output_file: str, segment_seconds:
     if not keep_tmp:
         rmtree(tmp_dir)
 
+def _parse_workers(host_specs):
+    """Parse list of 'host' or 'host:gpu' into [(host, gpu_id), ...]."""
+    workers = []
+    for spec in host_specs or []:
+        spec = (spec or "").strip()
+        if not spec:
+            continue
+        if ":" in spec:
+            host, gpu = spec.rsplit(":", 1)
+            try:
+                workers.append((host.strip(), int(gpu.strip())))
+            except ValueError:
+                workers.append((spec, 0))
+        else:
+            workers.append((spec, 0))
+    return workers
+
+
 if __name__ == '__main__':
     import argparse
-    parser = argparse.ArgumentParser(description='Splits a file into segments and processes them on multiple hosts in parallel using ffmpeg over SSH.')
+    parser = argparse.ArgumentParser(description='Splits a file into segments and processes them on multiple workers (host:gpu) in parallel using ffmpeg over SSH.')
     parser.add_argument('input_file', help='File to encode.')
     parser.add_argument('output_file', help='Path to encoded output file.')
-    parser.add_argument('remote_args', help='Arguments to pass to the remote ffmpeg instances. For example: "-c:v libx264 -crf 23 -preset fast"')
-    parser.add_argument('concat_args', default='', help='Arguments to pass to the local ffmpeg concatenating the processed video segments and muxing it with the original audio/subs/metadata. Mainly useful for audio encoding options, or "-an" to get rid of it.')
+    parser.add_argument('remote_args', help='Arguments to pass to the remote ffmpeg instances (e.g. NVENC: -c:v hevc_nvenc -preset p7 ...). -gpu is added per worker.')
+    parser.add_argument('concat_args', default='', help='Arguments to pass to the local ffmpeg concatenating the processed video segments and muxing it with the original audio/subs/metadata.')
     parser.add_argument('-s', '--segment-length', type=float, default=10, help='Segment length in seconds.')
-    parser.add_argument('-H', '--host', action='append', help='SSH hostname(s) to encode on. Use "localhost" to include the machine you\'re running this from. Can include username.', required=True)
+    parser.add_argument('-H', '--host', action='append', help='Worker as host or host:gpu (e.g. -H Pyro:0 -H RenderScrap:0 -H RenderScrap:1).', required=True)
     parser.add_argument('-k', '--keep-tmp', action='store_true', help='Keep temporary segment files instead of deleting them on successful exit.')
     parser.add_argument('-r', '--resume', action='store_true', help='Don\'t split the input file again, keep existing segments and only process the missing ones.')
     parser.add_argument('-t', '--tmp-dir', default=None, help='Directory to use for temporary files. Should not already exist and will be deleted afterwards.')
@@ -513,8 +534,11 @@ if __name__ == '__main__':
     parser.add_argument('-P', '--probe-host', default=None, help='SSH host to run ffprobe on (file must be at --probe-path there). Speeds up [1/4] when input is on slow/NAS path.')
     parser.add_argument('--probe-path', default=None, help='Path to input file as seen on --probe-host (required if -P set).')
     args = parser.parse_args()
+    workers = _parse_workers(args.host)
+    if not workers:
+        parser.error('At least one worker required (e.g. -H Pyro:0 -H RenderScrap:0)')
     encode(
-        args.host,
+        workers,
         args.input_file,
         args.output_file,
         segment_seconds=args.segment_length,