Darkbloom
Research Preview - Private inference on verified Apple Silicon

Cost-efficient private AI inference

Darkbloom routes encrypted requests to hardware-verified Apple Silicon providers, delivering comparable model performance at about 50% lower cost than typical API providers. Prompts stay hidden from operators, and Mac owners earn from compute they already own.

Your app Darkbloom Mac Studio verified Verified Macs serve inference MacBook verified Encrypted 50% lower cost Private encrypted request in - private result out
01 - What You Get
For developers

Private inference without a new SDK

Change the base URL and keep your existing OpenAI client. Requests are encrypted before they leave your app and routed to verified Apple Silicon providers.

Open Console ↗
For Mac owners

Turn idle Apple Silicon into earnings

Run a provider on hardware you already own. Darkbloom matches your Mac with inference demand, and operators keep 100% of inference revenue during the research preview.

Start Earning ↗
02 - Why It Costs Less
Most inference pricing includes several layers between silicon and the developer.

Capacity is bought, rented, repackaged, and metered before it reaches an API call. Each layer adds margin. Darkbloom routes demand to idle Apple Silicon instead, where the hardware is already paid for and the marginal cost is mostly electricity.
Typical API supply chain NVIDIA AWS Google Cloud Azure CoreWeave API providers End users
Apple has shipped over 100 million machines with serious ML hardware: unified memory, high bandwidth, Neural Engines, and enough RAM in high-end systems to serve large MoE models. Most of that capacity sits idle for long stretches every day.

Darkbloom turns that idle capacity into a private inference market.

Developers get lower prices without changing SDKs. Mac owners earn from machines they already own. The coordinator matches demand to providers, but prompts stay encrypted and hidden from the operator.
100M+
Apple Silicon machines shipped since 2020
50%
lower cost at comparable model performance
18hrs
average daily idle time per machine
100%
of inference revenue goes to the hardware owner
03 - The Privacy Problem
Routing to idle machines is only useful if the operator cannot read the request.

Prompts can contain customer conversations, internal plans, source code, and other sensitive context. A marketplace promise is not enough when inference runs on hardware you do not own.

Darkbloom is designed around a stricter guarantee: the coordinator can route requests, the provider can serve them, but neither should get a usable view of the prompt.

Private inference requires privacy that can be verified, not just promised.
04 - Privacy Architecture

Operator-blind by design

Darkbloom removes the practical software paths an operator could use to observe inference data. Four layers work together, each independently verifiable.

Encryption

Encrypted end-to-end

Requests are encrypted before transmission. The coordinator routes ciphertext, and only the matched provider's hardware-bound key can decrypt the request.

Hardware

Hardware-verified

Each provider uses a key generated inside Apple's tamper-resistant secure hardware. The attestation chain traces back to Apple's root certificate authority.

Runtime

Hardened runtime

The inference process is locked down at the OS level. Debugger attachment and memory inspection are blocked so the operator cannot inspect a running request.

Output

Traceable to hardware

Responses are signed by the specific machine that produced them. The attestation chain is public, so users can verify the hardware behind the result.

E2E Encryption encrypted before it leaves your device OS Integrity SIP enforced · signed system volume · binary self-hash Memory Isolation Hypervisor.framework · Stage 2 page tables Hardened Process debugger blocked · no shell access Your inference data prompts · responses · model state ↑ operator is here — every path inward is eliminated

The operator contributes compute, not visibility.

Your prompt is encrypted before it leaves your app. The coordinator routes traffic it cannot read. The provider serves the request inside a hardened process the operator cannot inspect.

Read the paper ↗
05 - Developer Experience

OpenAI-compatible API

Keep your SDK, request shape, and streaming code. Point the client at Darkbloom and start routing private inference.

python
from openai import OpenAI

client = OpenAI(
    base_url="https://api.darkbloom.dev/v1",
    api_key="your-api-key"
)

response = client.chat.completions.create(
    model="mlx-community/gemma-4-26b-a4b-it-8bit",
    messages=[{"role": "user", "content": "Hello!"}],
    stream=True
)

for chunk in response:
    print(chunk.choices[0].delta.content, end="")
Streaming - SSE in the OpenAI format
Large MoE - selected models up to 239B params
06 - Pricing

50% lower cost, comparable performance

Idle Apple Silicon keeps the cost structure simple. Pay per token with no subscription or minimum, with selected model prices set around 50% below typical API-provider rates for comparable models.

ModelInputOutputTypical APIvs typical API
Gemma 4 26B4B active, fast multimodal MoE$0.03$0.20$0.4050% lower
Qwen3.5 27BDense, frontier reasoning$0.10$0.78$1.5650% lower
Qwen3.5 122B MoE10B active, best quality$0.13$1.04$2.0850% lower
MiniMax M2.5 239B11B active, SOTA coding$0.06$0.50$1.0050% lower

Prices per million tokens. Typical API means published list rates for comparable models from major API providers.

07 - Earn

Earn from your Mac

Install the provider, choose when your Mac is available, and earn from inference jobs matched by the network. During the research preview, operators keep 100% of inference revenue.

100%
of inference revenue goes to you
Low
marginal cost on Apple Silicon

Install via Terminal

Downloads the provider binary and configures a background launchd service.

terminal
$ curl -fsSL https://api.darkbloom.dev/install.sh | bash
No dependenciesAuto-updatesRuns as launchd service

Earnings estimate

Select your hardware, model, active hours, and electricity cost to estimate provider earnings.

Auto-selected: most profitable for your hardware

$ /kWh

US avg: $0.15 · EU avg: $0.25 · CA avg: $0.22

Estimates only. Actual earnings depend on demand, model popularity, provider reputation, uptime, and local electricity cost.

Read the technical paper

Architecture, threat model, security analysis, and economic model for private inference on distributed Apple Silicon.

Download PDF ↗