Download and Upload Evidence
When you run an evaluation in AI GO!, the platform generates evidence, i.e. a task result log — a structured record of every inference call and computed score.
- Download a task result log to inspect evidence locally, feed results into analysis pipelines, integrate with third-party tools, or transfer results to another AI app.
- Upload a task result log to reproduce results in another AI app, reuse a cached run without rerunning inference, or save time and cost.
Task Result Log
A task result log is a JSON file produced after a task specification finishes running. It contains the raw inputs, model responses, and computed scores for every sample in the dataset. Because it captures the complete output of a task run, it can be re-uploaded to a different AI app or deployment to reproduce the same results without any additional inference.
Download Task Result Log
Downloading a task result log exports the inference outputs and scores from a completed evaluation run to disk, along with snapshots of all entities used in the evaluation.
Step 1: Setup (Optional)
Run the evaluation that you want to download the evidence for.
lf run -f run.yamlStep 2: Export Results
lf export eval serializes an evaluation into a portable YAML file.
--outputsets the destination directory.--link-logsdownloads the task result log for each task specification and saves it alongside the YAML — the exported file then includestask_result_log_pathentries pointing to those files.
lf export eval --id <eval ID> --output results --link-logsThe resulting directory looks like this:
results/
├── foobarization-evaluation.yaml
└── task_results/
└── foobarization-spec/
└── task_result_log.jsonThe exported foobarization-evaluation.yaml contains a task_result_log_path for each task specification:
evaluation:
key: foobarization-evaluation
task_specifications:
- key: foobarization-spec
task_result_log_path: task_results/foobarization-spec/task_result_log.jsonUpload Task Result Log
Uploading a task result log injects pre-computed inference outputs and scores into a new evaluation run.
Step 1: Setup (Optional)
A pre-condition to uploading pre-computed task result logs is that all entities used in the evaluation that produced the log are integrated into the new AI app. You can use the lf add to integrate the entities, but not run any evaluation.
lf add -f run.yamlStep 2: Run with Uploaded Log
Run the evaluation using the exported results. Because the exported evaluation YAML contains task_result_log_path entries, AI GO! uploads the logs instead of executing inference. The new evaluation will show the same results as the source evaluation.
lf run -f results/foobarization-evaluation.yamlFind a full code example for this guide in AI GO! integrations.
