>

Harbor x TensorLake: Infrastructure for Agentic Evals

Table of Contents heading

Harbor x TensorLake: Infrastructure for Agentic Evals

Mar 25, 2026

|

3

min read

We are thrilled to unveil the integration of TensorLake as a first-class environment provider in Harbor. This integration unlocks a new tier of scalability for agent evaluation, allowing developers to run thousands of concurrent benchmarks in secure, ephemeral MicroVMs designed specifically for the next generation of AI workloads, currently under review Pull Request #1237.

By combining Harbor's rigorous evaluation framework with TensorLake's high-performance infrastructure, we are defining the standard for reliable, scalable, and secure agent benchmarking.

What is TensorLake?

TensorLake is a specialized compute infrastructure for AI agents. Tensorlake provides stateful sandbox infrastructure with many dyanmic capabilites which makes it easy to deploy agents and creating RL Environments:

MicroVM Isolation: Firecracker VMs with sub 200 milliseconds startup time.
Stateful Suspend and Resume: Sandboxes are suspended automatically when they finish, and resume in case you want to re-use the VM for debugging or starting another task.
Clone: Running sandboxes can be cloned across the cluster to replicate an environment after setting it up.

Key Integration Features

1. Drop-in Scalability

Scale from 1 to 1,000 concurrent agents instantly. Switching to TensorLake in Harbor is as simple as changing a CLI flag.

harbor run --task-name [my-benchmark] --dataset [my-dataset] --env tensorlake

2. MicroVM Security

TensorLake uses MicroVMs to ensure that code executed by agents is completely isolated from your host infrastructure. This is critical when evaluating agents on untrusted code or complex benchmarks where "rm -rf /" might be a valid (but dangerous) agent action.

3. Resource Control & GPU Support

The integration supports fine-grained control over the sandbox resources directly from your Harbor config:

Compute: Configurable vCPUs and RAM.
Storage: Ephemeral disk sizing.
GPUs: Native support for GPU-accelerated workloads, essential for agents performing local inference or data science tasks.

4. State Management with Snapshots

Harbor leverages TensorLake's snapshot capabilities. You can start evaluations from pre-warmed states, significantly reducing setup time for complex environments that require heavy dependency installation.

TensorLake vs. Other Environments

Why choose TensorLake?

Vs. Daytona: While Daytona is excellent for persistent developer environments (long-running workspaces), TensorLake is optimized for the high-churn, ephemeral nature of agent loops where environments are created and destroyed rapidly.
Vs. E2B: Both offer excellent MicroVM sandboxing. TensorLake is particularly distinct in its broader ecosystem integration (Indexify) for extraction and workflow orchestration, making it a strong choice if your agents are part of a larger data processing pipeline.
Vs. Modal: Modal excels at serverless GPU compete and batch ML jobs. TensorLake is optimized for stateful, long-running agent loops, with native suspend/resume, live migration, and cloning that Modal doesn’t support. If your agents need to persist state across requests rather than isolated jobs, Tensorlake is the better fit.

Feature	E2B	Daytona	Modal	TensorLake
Primary Use Case	Code Execution	Dev Environments	Serverless Compute	Agent Infrastructure
Cold Start Time	~2s	~150ms	~500ms	MicroVM
Filesystem	1x Baseline	~3.3x Baseline	2x Baseline	5x Baseline
Auto Suspend/Resume	Yes	No	No	Yes
Clone Sandboxes	No	No	No	Yes
Point-in-Time Snapshots	No	Filesystem only	Alpha (7d TTL)	Yes
Stateful Execution	Partial	Partial	Partial	Native
Live Migration	No	No	No	Automatic
GPU Support	No	No	Yes	Yes
Scale Limit	Hundreds	Not Published	Thousands	Millions
Bring Your Own Cloud	No	No	No	Yes
Persistence	Ephemeral	Persistent Workspaces	Ephemeral	Snapshots & Ephemeral

Getting Started

1. Install the SDK:

pip install tensorlake

2. Set your API Key:

export TENSORLAKE_API_KEY="tl_..."

3. Run your first task (you need to set up the keys for your model):

harbor run --env tensorlake --task-name adaptive-rejection-sampler --dataset terminal-bench@2.0 --agent claude-code --model anthropic/claude-sonnet-4-6

Debugging

Need to see what the agent is doing inside the sandbox? Harbor exposes TensorLake's native debugging tools:

# Drops you directly into the running sandbox shell
harbor env attach <session_id>

Get server-less runtime for agents and data ingestion

Data ingestion like never before.

TRY TENSORLAKE

REQUEST A DEMO

TRUSTED BY PRO DEVS GLOBALLY

Tensorlake is the Agentic Compute Runtime the durable serverless platform that runs Agents at scale.

“With Tensorlake, we've been able to handle complex document parsing and data formats that many other providers don't support natively, at a throughput that significantly improves our application's UX. Beyond the technology, the team's responsiveness stands out, they quickly iterate on our feedback and continuously expand the model's capabilities.”

Vincent Di Pietro

Founder, Novis AI

"At SIXT, we're building AI-powered experiences for millions of customers while managing the complexity of enterprise-scale data. TensorLake gives us the foundation we need—reliable document ingestion that runs securely in our VPC to power our generative AI initiatives."

Boyan Dimitrov

CTO, Sixt

“Tensorlake enabled us to avoid building and operating an in-house OCR pipeline by providing a robust, scalable OCR and document ingestion layer with excellent accuracy and feature coverage. Ongoing improvements to the platform, combined with strong technical support, make it a dependable foundation for our scientific document workflows.”

Yaroslav Sklabinskyi

Principal Software Engineer, Reliant AI

"For BindHQ customers, the integration with Tensorlake represents a shift from manual data handling to intelligent automation, helping insurance businesses operate with greater precision, and responsiveness across a variety of transactions"

Cristian Joe

CEO @ BindHQ

“Tensorlake let us ship faster and stay reliable from day one. Complex stateful AI workloads that used to require serious infra engineering are now just long-running functions. As we scale, that means we can stay lean—building product, not managing infrastructure.”

Arpan Bhattacharya

CEO, The Intelligent Search Company

The Infrastructure Layer for Agents

Harbor x TensorLake: Infrastructure for Agentic Evals

What is TensorLake?

Key Integration Features

1. Drop-in Scalability

2. MicroVM Security

3. Resource Control & GPU Support

4. State Management with Snapshots

TensorLake vs. Other Environments

Why choose TensorLake?

Getting Started

Debugging

Related articles

Get server-less runtime for agents and data ingestion

"At SIXT, we're building AI-powered experiences for millions of customers while managing the complexity of enterprise-scale data. TensorLake gives us the foundation we need—reliable document ingestion that runs securely in our VPC to power our generative AI initiatives."

"For BindHQ customers, the integration with Tensorlake represents a shift from manual data handling to intelligent automation, helping insurance businesses operate with greater precision, and responsiveness across a variety of transactions"

“Tensorlake let us ship faster and stay reliable from day one. Complex stateful AI workloads that used to require serious infra engineering are now just long-running functions. As we scale, that means we can stay lean—building product, not managing infrastructure.”

Ship Agents Faster With Tensorlake