Overview
Artifacts are the data outputs generated by workflow runs. When workflows execute, they produce various types of artifacts such as extracted data (JSON, CSV), screenshots, documents (PDF), logs, and other files that represent the results of the data operation. Artifacts are the primary means of retrieving the actual data collected during workflow execution. They provide structured access to the information extracted from source systems, along with supporting files like screenshots and logs that help audit and debug the extraction process.Key Concepts
Artifact Properties
Each artifact includes:- Type: The category or nature of the artifact (e.g.,
extracted_data,screenshot,log_file) - Display Name: A human-readable name for the artifact
- MIME Type: The file type (e.g.,
application/json,image/png,application/pdf,text/csv) - File Size: Size of the artifact in bytes
- File Hash: A hash of the file content for integrity verification
- Storage Path: Internal reference to where the artifact is stored
- Signed URL: A temporary, secure URL for downloading the artifact content
Artifact Association
Artifacts are linked to:- Workflow ID: The workflow that generated the artifact
- Workflow Run ID: The specific workflow run execution that created it
- Find all artifacts ever generated by a workflow
- Retrieve artifacts from a specific workflow execution
Signed URLs
ThesignedUrl field provides secure, time-limited access to artifact content:
- Valid for a limited time (typically 15 minutes)
- Requires no additional authentication once generated
- Can be shared or used directly in applications
- Expires automatically for security
Artifact Types
Common artifact types include: Extracted Data- JSON files containing structured data extracted from source systems
- CSV files with tabular data
- XML or other structured formats
- Screenshots of key pages or steps during workflow execution
- PDF documents retrieved from source systems
- Images captured during the process
- Employee records downloaded from HR systems
- Financial reports from accounting systems
- Tax forms and compliance documents
API Operations
List Artifacts
List Artifacts
Retrieve artifacts for your organization with pagination support
Get an Artifact
Get Artifact
Retrieve detailed information about a specific artifact, including a fresh signed URL
Once you have the signed URL, you can download the artifact content directly. The URL expires after 15 minutes, so fetch a new one if needed.
Relationship with Other Resources
- Workflows: The template that defines artifact generation
- Workflow Runs: The execution instance that created the artifact
- Job Runs: The overall execution that triggered the workflow run
Best Practices
Artifact Retrieval
- Fetch When Needed: Request artifacts only when you need to access their content
- Handle Expiration: Implement logic to refresh signed URLs when they expire, if displayed in your application
- Filter Efficiently: Use specific queries to retrieve only needed artifacts
Security
- Protect URLs: Don’t expose signed URLs publicly or log them
- Time Awareness: Remember that signed URLs expire after 15 minutes
- Verify Sources: Confirm artifacts come from expected workflow runs
- Access Control: Implement proper access controls for artifact data