Custom ID for Batch Correlation

In short: Custom id for batch correlation is the practice of tagging every Message Batches request with a unique custom_id so you can match each asynchronous result back to the input that produced it. Because batch results can arrive in any order rather than input order, the custom_id is the only reliable handle for joining outputs to records and for pinpointing exactly which requests failed.

Why custom id for batch correlation exists

Custom id for batch correlation is the discipline that keeps an asynchronous batch from turning into an unmatched pile of answers. When you submit a synchronous request you get its answer back immediately, so there is never any doubt about which input produced which output. A batch breaks that one-to-one immediacy: you hand over thousands of requests at once, the system processes them independently, and the results file you download later can list them in any order. The model never sees a row number, and the API makes no promise to preserve input order, so without a deliberate key you cannot tell which generated summary belongs to which document.

The custom_id field is that key. You assign a unique string to every request in the batch, and the API echoes the exact same string back on the corresponding result. Joining outputs to inputs then becomes a straightforward lookup on that field. For the Claude Certified Architect exam this is an apply-level skill: you are expected to recognise that order is not guaranteed and to reach for custom_id as the correlation mechanism, not to recite its JSON schema.

custom_id: A unique identifier you attach to each request in a Message Batch. It must be 1 to 64 characters and contain only letters, numbers, hyphens, and underscores. The API returns it unchanged on the matching result, making it the join key between outputs and inputs.

Results come back in any order

The single fact that motivates this whole knowledge point is that batch results are unordered. Anthropic's documentation is explicit that results may not match the ordering of the requests when the batch was created, and that you should always use the custom_id field to match results to their requests. The reason is mechanical: each request is processed independently and finishes whenever it finishes, so a short request submitted last can complete before a long request submitted first. The results file simply records them in completion order, which is effectively arbitrary from your point of view.

This is why an architect who tries to zip the results array against the input array by position will silently corrupt their data. The first line of the results file is not the first request; it is just the first one that finished. The only correct join is on custom_id, and the only way that join works is if you assigned the ids carefully when you built the batch. You can confirm the unordered behaviour in Anthropic's batch processing guide.

custom_id as the join key across an unordered batch

Loading diagram...

Inputs and outputs are connected only by the matching custom_id, never by position in the file.

Design the id, do not just generate one

The temptation is to assign throwaway ids like a random UUID or a simple counter. That technically satisfies uniqueness, but it wastes the field. A far better practice is to encode a stable business key from your own data into the custom_id, so the join back to your records needs no separate mapping table. If you are classifying support tickets, use the ticket id. If you are enriching CRM rows, use the row's primary key. If you are summarising documents, use a document identifier, perhaps with a stage suffix when one record fans out into several requests.

Two constraints shape the id you choose. It must be unique within the batch, because a duplicate id makes two results indistinguishable, and it must fit the format: 1 to 64 characters using only letters, numbers, hyphens, and underscores. That rules out raw file paths with slashes, email addresses with at-signs, or anything with spaces, so you sanitise the business key before using it. A meaningful, well-formed id turns result handling into a one-line lookup and, as the next knowledge point shows, makes failure handling almost free.

Unique within the batch: never reuse an id, or two results collide.
Format-safe: 1 to 64 characters, letters, numbers, hyphens, underscores only.
Meaningful: encode a real key from your data so no side mapping is needed.
Stable: derive it from the source record, not from request position.

Worked example

You submit 5,000 contract documents to a batch for clause extraction. When the results file arrives, the lines are in a different order than you submitted, and your first instinct is to read them in sequence.

Imagine you ignored custom_id and assumed the results came back in input order. You would line up the first result against the first contract, the second against the second, and so on. Because the file is ordered by completion, not submission, almost every pairing would be wrong: the clauses extracted from contract 4,812 might be filed under contract 1, quietly poisoning the entire dataset. Worse, the corruption is invisible, since every contract still receives some result.

Now do it correctly. When you build the batch, set each request's custom_id to the contract id, for example contract-4812. When the results stream back, you read each line, take its custom_id, and write the extracted clauses to that exact contract record. Order no longer matters, because you are joining on identity rather than position. A result for contract-4812 lands on contract 4,812 whether it appears first or last in the file.

The same ids then carry into recovery. If 40 of the 5,000 came back with an errored result, you collect their custom_id values, and you instantly have the precise list of contracts to re-examine and resubmit, with no need to diff the inputs against the outputs. One well-chosen field solves both correlation and failure triage at once.

How correlation enables failure handling

Correlation is not only about success; it is the foundation for recovery. Every batch result, whether succeeded, errored, canceled, or expired, carries its custom_id. That means the moment you read the results you can partition them by outcome and, for anything that did not succeed, produce an exact list of which inputs to revisit. Because one request failing never affects the others, you can resubmit just that list rather than rerunning the whole batch. This is the bridge from this knowledge point into batch failure handling and retry, where the partition-and-resubmit pattern is developed in full.

It is also why correlation is treated as an apply-level competency rather than trivia. The id you assign at submission time determines how cleanly you can read results, how precisely you can recover from failures, and whether your data stays trustworthy. A sloppy id scheme makes every downstream step harder; a thoughtful one makes them nearly automatic.

What goes wrong when you skip correlation

It is worth dwelling on the failure mode, because its danger lies in how quietly it strikes. A pipeline that matches results to inputs by position does not crash, throw an error, or leave any gap. Every input receives a result, every record is populated, and every dashboard turns green. The damage is entirely in the pairing: summaries attached to the wrong documents, classifications filed against the wrong customers, extracted figures written to the wrong rows. Nothing alerts you, because from the system's point of view nothing failed. The corruption can sit undetected until someone downstream notices that a contract's summary describes a different contract entirely, by which point the bad data may have spread through reports and decisions.

That silent quality is what makes correlation a correctness requirement rather than a nicety. Most bugs announce themselves; this one hides inside plausible-looking output. The only defence is to never rely on order in the first place, and to treat the echoed identifier as the single source of truth for which result belongs to which input. An architect who has seen this failure once never matches by position again, and the exam expects you to recognise the trap without having to be burned by it.

Composing ids that stay unique and meaningful

As workloads grow, a flat identifier scheme starts to strain, and a little structure pays off. A common pattern is to namespace the identifier so it stays unique even when several jobs or stages share a results store: a date or run prefix, then the record key, then an optional stage suffix when one source record fans out into several requests. A document split into three chunks might carry ids that pair the document key with a part number, so each chunk is independently addressable yet still obviously belongs to the same source. The only hard rules are that the whole string stays within the format, letters, numbers, hyphens, and underscores up to 64 characters, and that it remains unique within the batch.

Designing ids this way also makes reruns clean. Because the identifier is derived from stable source keys rather than from submission position, a resubmitted request can carry the very same id it had the first time, so its eventual result still lands on the right record without any special handling. That idempotent quality, where the same input always maps to the same identifier, is what lets correlation and recovery compose so smoothly into the larger pipeline, and it is why thoughtful id design repays the small effort it costs up front.

The batch ID and the custom_id do different jobs

When you correlate at scale it helps to keep two identifiers straight, because they answer different questions. The batch itself has a job-level identifier, returned when you create it, that names the whole submission: you use it to poll status, list a batch, fetch its results file, or cancel it. The custom_id lives one level down, on each request inside that batch, and names a single item. The job ID tells you which run you are looking at; the custom_id tells you which record a given result belongs to.

A robust pipeline persists both alongside its source data. Storing the job ID lets you find and re-fetch the correct results file later, even across many concurrent runs, while the custom_id lets you join each line of that file back to the exact input. Confusing the two, or storing only the job ID, leaves you able to locate a batch but unable to say which document any particular answer describes, which is precisely the alignment problem correlation exists to prevent.

How this knowledge point shows up on the exam

Questions on correlation usually describe a batch whose results are being mishandled and ask you to spot the flaw or choose the fix. The wrong answers lean on positional matching ("read the results in order") or on reprocessing everything when a few items fail. The right answers always route through custom_id: use it to join results to inputs, and use it again to isolate failures. If a scenario hints that results "came back in a different order," that is the examiner signalling this exact knowledge point, and the correct response is to correlate on the id rather than the index.

Misconception

Batch results come back in the same order I submitted them, so I can match them to my inputs by position.

What's actually true

Batch results can be returned in any order, because each request finishes independently. Matching by position will misalign outputs and inputs. Always join on the custom_id that the API echoes back on each result.

Misconception

A random UUID is the best custom_id because it guarantees uniqueness.

What's actually true

A UUID is unique but meaningless, forcing you to keep a separate mapping from id to record. Encoding a stable business key, such as a document or row id, makes the result file join directly back to your data with no extra lookup, while still being unique.

Check your understanding

An engineer submits 10,000 product records to a batch to generate descriptions, assigning custom_id values of 1, 2, 3 and so on in submission order. They plan to read the results file top to bottom and apply each line to the next product. Why is this design unsafe, and what should change?

Watch and learn

Official Anthropic Academy lessons first, then hand-picked walkthroughs. Videos load only when you press play.

No videos curated for this concept yet

We are still curating the best official and community videos for this topic.

References & primary sources

Adaptive study

Master this concept with Archie

Practice it inside an adaptive study session. Archie, your Socratic AI tutor, tracks your mastery with Bayesian Knowledge Tracing and schedules the perfect next review.

Start studying

Custom ID for Batch Correlation in the Claude Batches API