NodeGhost — Docs & Setup Guides

// getting started

Get your API key

After completing checkout, NodeGhost will automatically send your API key to the email address you used at signup. The key looks like this:

ng-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

Keep this key private — it controls access to your plan's request quota. If you lose it, contact monitee@gmail.com and we'll reissue one.

Free plan: No credit card required. Sign up and your key is emailed instantly. Free tier includes 200M CU per month — roughly 500 inference calls or 4,000 web searches.

Using the API

NodeGhost is a drop-in replacement for the OpenAI API. Most customers authenticate with an ng- key obtained via Stripe subscription or USDC payment; native POKT stakers use on-chain delegation instead (Path 3 below).

Path 1 — ng- key (Stripe or USDC)

If you signed up via Stripe or paid with USDC, you have an ng- API key. Register your model endpoint once, then use it like any OpenAI-compatible API:

curl https://nodeghost.ai/v1/chat/completions \
  -H "Authorization: Bearer ng-your-key-here" \
  -H "X-Endpoint-Key: sk-your-model-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "your-model-here",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

X-Endpoint-Key: Pass your model provider's API key (OpenAI, Groq, DeepSeek, etc.) in the X-Endpoint-Key header. Register your endpoint first at POST /v1/endpoint/register.

Path 2 — Python (ng- key)

from openai import OpenAI

client = OpenAI(
    api_key="ng-your-key-here",
    base_url="https://nodeghost.ai/v1",
    default_headers={"X-Endpoint-Key": "sk-your-model-api-key"}
)

response = client.chat.completions.create(
    model="your-model-here",
    messages=[{"role": "user", "content": "Hello!"}]
)

print(response.choices[0].message.content)

Path 3 — Native POKT stake

If you've staked a POKT application directly on Shannon mainnet, see the Stake your app wallet section below for the full flow — different endpoint, different headers, no ng- key required.

Supported model providers

Provider	X-Endpoint value	Example model
OpenAI	`https://api.openai.com`	`gpt-4o-mini`
DeepSeek	`https://api.deepseek.com`	`deepseek-chat`
Groq	`https://api.groq.com/openai`	`llama-3.1-70b-versatile`
Anthropic	`https://api.anthropic.com`	`claude-3-5-haiku-20241022`
Self-hosted (Ollama etc.)	`https://your-server.com`	`llama3.2:3b`

Plans & CU metering

NodeGhost meters usage in Compute Units (CU). Each plan includes a monthly CU pool that resets each month. Different services consume CU at different rates — inference is the most expensive, web search and vector memory are much cheaper.

Plans

Plan	Monthly CU	API keys	Price
Free	200,000,000	2	$0/mo
Starter	9,000,000,000	5	$9/mo
Pro	29,000,000,000	20	$29/mo
Agent	99,000,000,000	Unlimited	$99/mo

CU per call

Service	Endpoint	CU per call
Inference	`/v1/chat/completions`	400,000
Web search	`/v1/tools/search`	50,000
Vector memory	`/v1/memory/*`	25,000

FREE 200M CU ≈ 500 INFERENCE CALLS · STARTER 9B ≈ 22,500 · PRO 29B ≈ 72,500 · AGENT 99B ≈ 247,500

Mixed usage: Because tools and memory cost a fraction of inference, agents that combine search and memory with inference fit many more total calls. A turn that uses 1 web search (50K) + 1 memory recall (25K) + 1 inference (400K) totals 475K CU.

// home assistant

Home Assistant overview

NodeGhost works as the AI brain for Home Assistant — giving you a genuinely intelligent voice assistant that runs privately, without sending your conversations to Google, Amazon, or Apple.

The full private stack looks like this:

1

Local speech-to-text (Whisper)

Converts your voice to text entirely on your device. Nothing leaves your home at this stage.

2

NodeGhost AI (via POKT)

Your text is routed through the POKT decentralized network to your registered model endpoint. No logging of request content — ever.

3

Local text-to-speech (Piper)

The AI response is converted back to voice on your device. Your assistant speaks back to you.

Total cost: $9/month on the Starter plan. Smarter than Alexa, more private than everything.

Install the integration

NodeGhost uses the Extended OpenAI Conversation integration available through HACS (Home Assistant Community Store).

Step 1 — Install HACS

If you don't have HACS installed yet, follow the official guide at hacs.xyz. It adds a community add-on store to your Home Assistant instance.

Step 2 — Install Extended OpenAI Conversation

1

Open HACS in Home Assistant

Go to HACS → Integrations → search for "Extended OpenAI Conversation" → Download.

2

Restart Home Assistant

After downloading, restart Home Assistant to load the new integration.

3

Add the integration

Go to Settings → Devices & Services → Add Integration → search for "Extended OpenAI Conversation".

4

Configure NodeGhost credentials

When prompted, enter the following:

API Key:  ng-your-key-here
Base URL: https://nodeghost.ai/v1
Model:    your-model-here

The model name must match your registered endpoint. Pass your model provider's API key in the X-Endpoint-Key header if your integration supports custom headers — otherwise register your endpoint via POST /v1/endpoint/register first.

5

Set as your conversation agent

Go to Settings → Voice Assistants → select your assistant → set Conversation Agent to "Extended OpenAI Conversation".

Tip: Once configured, you can talk to your assistant by saying "Hey Jarvis" (or whatever wake word you've set) and it will use NodeGhost for the AI response.

Add local voice (Whisper + Piper)

For fully private voice — no cloud at any step — install the local speech add-ons. These run entirely on your Home Assistant hardware.

Install Whisper (speech to text)

1

Go to Settings → Add-ons → Add-on Store

Search for "Whisper" — install the official "Whisper" add-on by Home Assistant.

2

Start the add-on and enable on boot

In the add-on settings, toggle "Start on boot" and "Watchdog" then hit Start.

3

Add the Wyoming integration

Go to Settings → Devices & Services → Add Integration → search "Wyoming Protocol" → configure it pointing to the Whisper add-on.

Install Piper (text to speech)

Repeat the same steps for the "Piper" add-on. Once both are installed, go to Settings → Voice Assistants and set:

Speech-to-text:  Whisper
Text-to-speech:  Piper
Conversation:    Extended OpenAI Conversation (NodeGhost)

Hardware note: Whisper runs best on a Raspberry Pi 4 or 5 with at least 4GB RAM. On older hardware, use the "tiny" model for faster response times.

Remote access

To use the Home Assistant app and your NodeGhost voice assistant when you're away from home, you need a way to reach your Pi remotely. The free option is a Cloudflare Tunnel.

Option A — Cloudflare Tunnel (free)

1

Get a free domain or use one you own

Cloudflare Tunnels require a domain managed by Cloudflare. You can transfer an existing domain or register one at cloudflare.com.

2

Install the Cloudflare Tunnel add-on in HA

In HACS, search for "Cloudflare Tunnel" and install it. Configure it with your Cloudflare token from the Cloudflare Zero Trust dashboard.

3

Point the HA app at your tunnel URL

In the Home Assistant app settings, set your external URL to your Cloudflare tunnel address (e.g. https://home.yourdomain.com).

Option B — Nabu Casa ($6.50/month)

Nabu Casa provides an easy one-click remote access tunnel. It works alongside NodeGhost — Nabu Casa handles the network tunnel, NodeGhost handles the AI. Go to Settings → Home Assistant Cloud to subscribe.

Note: Either remote access option works with NodeGhost. Your AI requests always route through NodeGhost regardless of how you connect remotely.

// bring your own model

Run your own AI model through NodeGhost

NodeGhost is a privacy-preserving inference gateway — not a managed model service. Register your own OpenAI-compatible endpoint and route all inference through NodeGhost's infrastructure, giving you complete control over which AI model processes your data while NodeGhost handles auth, rate limiting, and decentralized routing.

This means you can run an open source model like Llama, Mistral, or Qwen on your own server, point NodeGhost at it, and use the same https://nodeghost.ai/v1 endpoint you already know. Your model does the inference. NodeGhost handles everything else.

Available on all plans. Custom endpoint registration is included at every tier — from Free to Business. You bring the model, NodeGhost brings the infrastructure layer.

Why run your own model?

There are several reasons you might want to bring your own endpoint:

You've fine-tuned a model on proprietary data and need inference for it
You want complete privacy — no third party sees your prompts at any layer
You want to use a specific open source model not available through NodeGhost's default backend
You're building a product on top of your own model and need auth and rate limiting infrastructure
You want to protect your model endpoint from being exposed publicly

How it works

When you register a custom endpoint, NodeGhost stores it linked to your API key. Every inference request you make is authenticated and rate limited as normal — then forwarded to your endpoint via the POKT network. Your model processes the request and returns the response through NodeGhost back to your application.

Your endpoint is never exposed publicly. All traffic enters through nodeghost.ai/v1 and NodeGhost proxies it to your server privately.

Register your endpoint

Your endpoint must be publicly accessible over HTTPS and OpenAI-compatible. Any server running Ollama, LM Studio, vLLM, or a custom FastAPI wrapper will work.

Step 1 — Run an OpenAI-compatible model server

The most common option is Ollama. Install it on any VPS or server and expose it publicly:

# Install Ollama on your server
curl -fsSL https://ollama.com/install.sh | sh

# Pull a model
ollama pull llama3.2

# Start with HTTPS via nginx reverse proxy
# Your endpoint will be: https://your-server.com/v1/chat/completions

HTTPS required. NodeGhost only accepts endpoints served over HTTPS. Use a reverse proxy like nginx with a Let's Encrypt certificate to secure your Ollama server.

Step 2 — Register your endpoint with NodeGhost

Call the registration endpoint with your ng- API key:

curl -X POST https://nodeghost.ai/v1/endpoint/register \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_NG_KEY" \
  -d '{
    "endpoint_url": "https://your-server.com",
    "endpoint_name": "My Llama Server"
  }'

NodeGhost will verify your endpoint is reachable and return a confirmation:

{
  "success": true,
  "endpoint_url": "https://your-server.com",
  "endpoint_name": "My Llama Server",
  "message": "Custom endpoint registered. Your inference calls will now route to this endpoint via the POKT network."
}

Step 3 — Use NodeGhost as normal

Nothing changes in your application. Keep using https://nodeghost.ai/v1 with your ng- key. NodeGhost silently routes to your registered model endpoint:

from openai import OpenAI

client = OpenAI(
    base_url="https://nodeghost.ai/v1",
    api_key="YOUR_NG_KEY"
)

# Routed through POKT to your registered endpoint
response = client.chat.completions.create(
    model="llama3.2",  # match your model name
    messages=[{"role": "user", "content": "Hello"}]
)

Check your routing status

curl https://nodeghost.ai/v1/endpoint/register \
  -H "Authorization: Bearer YOUR_NG_KEY"

Remove your custom endpoint

To remove your registered endpoint:

curl -X DELETE https://nodeghost.ai/v1/endpoint/register \
  -H "Authorization: Bearer YOUR_NG_KEY"

Full privacy stack — zero third parties

If complete privacy is your goal — where no third party ever sees the content of your AI requests — this is the architecture that achieves it.

The goal: Your prompts leave your device, get authenticated through NodeGhost, and land on your own server running your own model. Nobody else is in the chain.

The full stack

Here's every component and who controls it:

Your application

Home Assistant, custom app, anything OpenAI-compatible

You control

NodeGhost gateway — powered by POKT

Auth, rate limiting, and decentralized routing via POKT Shannon mainnet — never logs content

NodeGhost

Your VPS running Ollama

Any provider — Hetzner, DigitalOcean, Vultr, etc.

You control

Your open source model

Llama, Mistral, Qwen, Phi — weights downloaded once, run locally

You control

What this means for privacy

With this setup your prompts travel from your application to NodeGhost for authentication and routing through the POKT decentralized network, then directly to your own server. The request enters through a gateway with no single point of control — POKT's on-chain verification ensures the routing is transparent and auditable. The model that processes your request runs on hardware you control. No AI company, no cloud provider, no GPU farm ever sees what you're asking.

NodeGhost sees that a request happened — the timestamp and your API key — but never the content. That metadata is used only for rate limiting and is retained for 90 days.

Recommended models for self-hosting

These open source models run well on a modest VPS with 16GB+ RAM:

General purpose

Llama 3.2 8B

Fast, capable, 8GB RAM minimum

Coding & reasoning

Qwen 2.5 14B

Strong reasoning, 16GB RAM recommended

Lightweight

Phi-4 Mini

Excellent quality, runs on 4GB RAM

High performance

Mistral 7B

Fast inference, 8GB RAM minimum

VPS recommendation: A Hetzner CX32 (4 vCPU, 8GB RAM, ~€8/month) runs Llama 3.2 8B comfortably via Ollama. For larger models, a CX42 (8 vCPU, 16GB RAM) handles most 14B models well.

// crypto native

What is POKT?

POKT is the native token of Pocket Network — the decentralized infrastructure that NodeGhost is built on. Every AI request you make through NodeGhost is routed through the POKT Shannon network, verified on-chain, and settled between gateway operators and node suppliers.

This is what makes NodeGhost fundamentally different from other AI providers — the routing is on-chain and decentralized, so there's no single company that could log, monitor, or sell your requests even if they wanted to.

For most users, POKT is invisible — you just use your ng- API key and pay via Stripe. But if you're crypto-native and want to interact with the network directly, you can stake your own application wallet and pay per relay in POKT tokens.

Stake your app wallet

Advanced users can bypass the Stripe subscription entirely by staking a POKT application wallet directly on Shannon mainnet. This gives you pay-per-relay access at the rate set by each service — for example ai-inference is priced at $0.0004 per relay. Each service has its own relay price, so the cost depends on which service you stake against.

External native staking is currently supported only for the ai-inference service. Other POKT services (web-search, text-generation, vector-memory) are operated as NodeGhost-internal infrastructure and not useful targets for external stakers.

Advanced: This requires familiarity with blockchain wallets and the POKT CLI. If you're new to crypto, the Stripe plans are much easier to get started with.

What you'll need

A funded POKT wallet with enough tokens to stake an application. Current minimum is 1,001 POKT. You'll also need pocketd installed on your machine.

Stake your application

pocketd tx application stake-application \
  --config=app-stake-config.yaml \
  --keyring-backend=test \
  --from=your-wallet-name \
  --network=main \
  --fees 2000upokt \
  --yes

Your app-stake-config.yaml should specify the ai-inference service:

stake_amount: "1001000000upokt"
service_ids:
  - ai-inference

Once staked, delegate your application to NodeGhost's gateway address:

pocketd tx application delegate-to-gateway pokt1ecrykpsr87juxcpdn2yxq8mnfrvhrs85dk5y3t \
  --from=your-wallet-name \
  --keyring-backend=test \
  --network=main \
  --fees 2000upokt \
  --yes

Required: The delegate step is mandatory — without it you will get a "gateway does not have delegation" error when making requests. Wait ~10 minutes after staking for the session to roll over before testing.

Gateway address for reference:

Gateway address: pokt1ecrykpsr87juxcpdn2yxq8mnfrvhrs85dk5y3t

Using the gateway after staking

Once staked and delegated, hit the native POKT endpoint with your model provider's API key as Authorization, your provider URL as X-Endpoint, and your application address as X-App-Address:

curl https://nodeghost.ai/pokt/v1/chat/completions \
  -H "Authorization: Bearer sk-your-model-api-key" \
  -H "X-Endpoint: https://api.openai.com" \
  -H "X-App-Address: pokt1your-staked-app-address-here" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o-mini",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Supported providers via X-Endpoint: OpenAI, DeepSeek, Groq, Anthropic, or any OpenAI-compatible self-hosted model. If X-Endpoint is omitted, defaults to https://api.openai.com.

The X-App-Address header

Every request must include your application's POKT address in the X-App-Address header. This tells the gateway which of your delegated applications to claim the relay under.

If you've staked a single application, this is just your one app address on every request. If you've staked multiple applications across different services, include the address matching the service you're calling.

X-App-Address: pokt1zcutxcp2nw92m8gz4aum8e9lapgztvr7j0d9ay

The gateway validates the address (regex ^pokt1[a-z0-9]{38}$) and confirms on-chain that it delegates to the NodeGhost gateway. If missing or malformed, the request returns HTTP 400.

Need help? Join the POKT Discord at discord.gg/pokt or email us at monitee@gmail.com.

// agentic payments

USDC on Base

USDC payments via the Base network are currently available for select customers. We're building out wallet-based payment association so USDC sends from your registered wallet credit automatically — without exposing your API key on-chain.

If you'd like to use USDC today while we finalize this, email monitee@gmail.com and we'll provision your account manually.

// hosted models

Hosted models

NodeGhost offers access to hosted open source models routed through the POKT decentralized network. No external API key required — just your ng- key. These models are served by independent node operators on Shannon mainnet, earning POKT relay rewards for every request.

Privacy note: Hosted model requests are routed through POKT the same way as all NodeGhost traffic — decentralized, no logs, no surveillance. The model name is anonymized at the supplier level.

Model	Endpoint	Model name	Status
Llama 3.2 1B Instruct	`/text-generation/v1/`	`pocket_network`	Live

Text generation

Access Llama 3.2 1B Instruct via the POKT decentralized network. No model API key needed — requests are handled by independent POKT node operators. Use pocket_network as the model name.

Example request (curl)

curl https://nodeghost.ai/text-generation/v1/chat/completions \
  -H "Authorization: Bearer ng-your-key-here" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "pocket_network",
    "messages": [{"role": "user", "content": "Hello!"}],
    "max_tokens": 200
  }'

Example request (Python)

from openai import OpenAI

client = OpenAI(
    api_key="ng-your-key-here",
    base_url="https://nodeghost.ai/text-generation/v1"
)

response = client.chat.completions.create(
    model="pocket_network",
    messages=[{"role": "user", "content": "Hello!"}],
    max_tokens=200
)

print(response.choices[0].message.content)

Note: Hosted models count against your relay balance at the same rate as standard inference. Max 2,047 tokens per request. Logprobs are not supported.

// ai tools

Tools as POKT services

NodeGhost provides AI tools as first-class POKT services. Each tool call is a single relay — flat cost, no tokens consumed, no context window, completely stateless. Tools return structured data only, never raw web content, making them safe and prompt-injection resistant by design.

All tools require a valid ng- API key. Tool calls count against your relay balance — web search costs 1 relay at 50,000 CU, which is cheaper than inference (400,000 CU). This means your relay balance goes further when agents use search than when they run inference.

Tool	Endpoint	CU per relay	Status
Web search	`POST /v1/tools/search`	50,000	Live
Tool discovery	`GET /v1/tools`	Free	Live

Privacy note: All tool calls are stateless. No query is stored. No session is maintained. Each request is completely independent — the tool server has no memory of previous calls.

// ai tools

Web search

Search the web privately via Brave Search. Returns titles, URLs, and snippets. Routed through POKT decentralized network — the search engine never sees the agent's identity or IP address.

REQUEST

POST /v1/tools/search
Authorization: Bearer ng-yourkey
Content-Type: application/json

{
  "query": "your search terms",
  "count": 5
}

RESPONSE

{
  "query": "your search terms",
  "count": 3,
  "results": [
    {
      "title": "Result title",
      "url": "https://example.com",
      "snippet": "Brief description of the result..."
    }
  ]
}

Parameters

Parameter	Type	Required	Description
query	string	Yes	Search terms, max 400 characters
count	integer	No	Number of results, 1–20, default 5

Also supports GET requests: GET /v1/tools/search?q=your+query&count=5

Using web search as an agent tool

Web search is designed to be called by AI agents via the OpenAI tool calling format. Pass it as a tool definition in your inference request and the model will call it automatically when it needs current information:

curl https://nodeghost.ai/v1/chat/completions \
  -H "Authorization: Bearer ng-yourkey" \
  -H "X-Endpoint-Key: sk-yourmodelkey" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "your-model",
    "messages": [{"role": "user", "content": "What is the current price of POKT?"}],
    "tools": [{
      "type": "function",
      "function": {
        "name": "web_search",
        "description": "Search the web for current information",
        "parameters": {
          "type": "object",
          "properties": {
            "query": {"type": "string", "description": "Search terms"}
          },
          "required": ["query"]
        }
      }
    }]
  }'

Agentic payload size: When using web search in a multi-step agent loop, compress search results before adding them to the conversation history. The POKT relay protocol has a ~100KB payload limit — full JSON search results can exceed this in long conversations. Extract title, snippet, and URL as plain text instead of passing raw JSON.

// ai tools

Tool discovery

Query this endpoint to discover all available NodeGhost tools, their descriptions, parameters, and endpoints. Designed for autonomous agents that need to discover available capabilities without hardcoded configuration.

REQUEST

GET /v1/tools

RESPONSE

{
  "provider": "NodeGhost",
  "description": "Privacy-preserving AI tools routed through POKT",
  "cost": "1 POKT relay per tool call = 400,000 CU",
  "tools": [
    {
      "name": "web-search",
      "description": "Search the web privately via Brave Search...",
      "endpoint": "https://nodeghost.ai/v1/tools/search",
      "method": "POST",
      "parameters": { ... },
      "privacy": "Queries are never logged or stored."
    }
  ]
}

No authentication required for tool discovery. Authentication is required to call individual tools.

// faq

Frequently asked questions

Do you log my conversations?

+

No. NodeGhost does not log, store, or inspect the content of your AI requests. This is an architectural guarantee — requests are routed through a decentralized network where no single entity has visibility into what you're asking. We log only usage metadata (request count, timestamp, API key) for billing and rate limiting purposes, retained for 90 days.

What models does NodeGhost use?

+

NodeGhost is a bring-your-own-model gateway. Register any OpenAI-compatible endpoint — OpenAI, Groq, Together AI, Anthropic, or your own self-hosted model like Llama or Mistral — and NodeGhost routes your inference through the POKT decentralized network. You choose the model, NodeGhost handles the infrastructure.

Is NodeGhost compatible with Home Assistant?

+

Yes — NodeGhost works as a drop-in replacement for OpenAI inside the Extended OpenAI Conversation integration. Set the Base URL to https://nodeghost.ai/v1 with your ng- API key, and set the model name to match your registered endpoint. Register your model endpoint first at POST /v1/endpoint/register. See the Home Assistant setup guide above for step-by-step instructions.

How is this different from just using OpenAI directly?

+

Three main differences: (1) Privacy — requests route through a decentralized POKT network, not a centralized corporate server. (2) Price — Free tier available, paid plans start at $9/month vs OpenAI's $20/month. (3) Sovereignty — no single company controls the infrastructure your requests flow through.

What happens when I hit my request limit?

+

Your API key will return a 429 rate limit error until your quota resets on the 1st of the next month. You can upgrade your plan at any time to get more requests immediately. We don't charge overage fees — you just hit the limit and stop.

Can I use NodeGhost without Stripe or USDC?

+

Yes — crypto-native users can stake a POKT application wallet directly on Shannon mainnet and use the gateway without any subscription or payment. Stake 1,001 POKT against the ai-inference service, delegate to the NodeGhost gateway address, and hit https://nodeghost.ai/pokt/v1/chat/completions with your own model provider API key (in Authorization) and your application address (in X-App-Address). See the Stake your app wallet section above for full instructions.

What are the X-Endpoint and X-App-Address headers?

+

Both headers are required on the native POKT stake path (/pokt/v1/chat/completions). X-Endpoint tells NodeGhost which model provider to forward your request to — for example https://api.openai.com, https://api.deepseek.com, or https://api.groq.com/openai (defaults to OpenAI if omitted). X-App-Address is your staked POKT application's bech32 address (pokt1...) — it tells the gateway which of your delegated applications to claim the relay under, and is required as of 2026-05-17. NodeGhost never holds your model API key — it's passed directly through to your provider.

How is NodeGhost pricing determined?

+

NodeGhost pricing is designed to stay competitive with native POKT stake cost — the price anyone can access by staking directly on the network. We don't offer price locks because our infrastructure costs are tied to the POKT protocol, which can change. What we do promise is that Stripe pricing will always be close to native stake cost plus a small convenience premium for not needing crypto or a CLI. If you'd rather pay at the protocol level directly, see Stake your app wallet.

Can I use NodeGhost for commercial projects?

+

Yes — the Pro plan ($29/month, 29B CU) is sized for typical commercial use, and the Agent plan ($99/month, 99B CU, unlimited keys) covers higher-volume agent workloads. For custom arrangements beyond these tiers, contact monitee@gmail.com.

What is POKT Network and why does it matter?

+

Pocket Network (POKT) is a decentralized infrastructure protocol that coordinates thousands of independent node operators. NodeGhost is built on top of POKT's Shannon mainnet — meaning your requests are routed through a network of independent suppliers rather than servers owned by a single company. This is what makes the privacy guarantee architectural rather than just a policy promise.

Do you offer refunds?

+

Yes — we offer a 7-day refund window on all paid plans. If NodeGhost doesn't work for your use case within the first 7 days, contact monitee@gmail.com for a full refund.

I lost my API key — what do I do?

+

Email monitee@gmail.com from the address you signed up with and we'll reissue your key. For security, the old key will be revoked when we issue the new one.

// ghost chat

Ghost Chat

Ghost is NodeGhost's built-in private AI chat interface available at nodeghost.ai/ghost. It combines all three NodeGhost POKT services in one page — inference, web search, and vector memory — routing every request through the decentralized network.

Three POKT services per conversation: Every message may trigger up to three separate POKT relay calls — memory recall (vector-memory), web search (web-search), and inference (ai-inference) — all earning relay rewards.

To use Ghost, open the page and enter your ng- key, model, and base URL. These are saved in your browser's localStorage and persist across sessions.

// ghost chat

Memory & encryption

Scope: The client-side encryption described in this section applies only to personal memories created through the Ghost chat at nodeghost.ai/ghost, and only when you opt in to encryption on first use. It does not apply to the hosted knowledge base, the ng-memory npm package, the browser extension, or self-hosted ng-memory-server instances. See the comparison below for each surface's storage model.

When you opt in on the Ghost chat page, Ghost automatically remembers facts from your conversations and recalls them in future sessions. Personal memories are encrypted in your browser before being sent to the memory server — NodeGhost stores ciphertext only, and your password never leaves your browser.

Encryption is optional. If you choose “Skip encryption” on first use, no personal memories are stored at all — the page operates as a stateless chat interface with no recall.

How it works (Ghost chat only)

1

Create a memory password

On first use, Ghost prompts you to create a password. This password never leaves your browser — it's used to derive an AES-256-GCM encryption key using PBKDF2 (310,000 iterations, SHA-256).

2

Facts extracted client-side

After each response, your browser calls the model to extract key facts from the conversation. The fact-extraction prompt is sent to the inference endpoint, but the resulting facts are processed locally before storage.

3

Encrypted before storage

Each extracted fact is encrypted in your browser using your password-derived key (AES-256-GCM with a random per-fact IV). Only the base64-packed ciphertext is sent to the memory server. If you inspect the database, you'll only see encrypted blobs.

4

Decrypted locally on recall

When you send a new message, Ghost queries the memory server, receives the ciphertext blobs, and decrypts them locally in your browser. The plaintext context is injected into the system prompt of your inference call — the server never sees the decrypted text.

Password responsibility: Your memory password cannot be recovered. If you forget it, your encrypted memories are permanently inaccessible. There is no reset or recovery — this is by design. NodeGhost cannot decrypt your memories even if requested.

Memory feed indicators

Indicator	Meaning
`◈ X results from knowledge base`	Public business knowledge was recalled (plaintext, not encrypted)
`◈ X results from personal memory`	Your encrypted personal memories were recalled and decrypted locally
`◈ X results from knowledge base + personal memory`	Both sources contributed context
`🔒 Encrypted` badge in header	Encryption is active for this session

Memory surfaces compared

NodeGhost exposes several memory surfaces with different storage and trust models. Read this before choosing one.

Surface	Storage	Encrypted?
Ghost chat — personal memory `nodeghost.ai/ghost`	Ciphertext on operator VPS, sent via the `vector-memory` POKT service. Decrypted only in your browser.	Yes (AES-256-GCM, opt-in)
Hosted knowledge base `POST /v1/memory/upload`	Plaintext SQLite on operator VPS, per-org isolated. Operator has technical access to uploaded documents.	No
`ng-memory` npm package	Local SQLite file in Node, or browser IndexedDB. Memory data never leaves your device by default.	No (storage is local-only)
NodeGhost browser extension	`chrome.storage.local` in your browser. Memory data never leaves your device, but conversation turns are sent to the inference endpoint for fact extraction.	No
Self-hosted ng-memory-server	Same `ng-memory-server` software as the hosted KB, run on your own infrastructure. Plaintext SQLite. You control access.	No

If you need encrypted memory storage with a NodeGhost-hosted backend today, the Ghost chat page is the only path. Other surfaces store plaintext — pick the one whose threat model fits your use case (your own server, your own machine, or operator-trusted).

// ghost chat

Knowledge base

In addition to personal encrypted memories, Ghost automatically queries a shared knowledge-base namespace on every message. This namespace is public and readable by anyone with a valid ng- key — it's designed for business content like product docs, FAQs, and support guides.

Upload documents to the knowledge base using the Memory Admin Panel. The knowledge base is queried through POKT's vector-memory service — the same decentralized routing as personal memories.

No password needed: Knowledge base queries don't require an encryption password. Even users who skipped encryption setup will receive knowledge base context in their responses.

// memory server

ng-memory-server

ng-memory-server is a self-hosted RAG (Retrieval Augmented Generation) server that stores and retrieves memories using vector embeddings. It's the backend that powers Ghost's memory system and the knowledge base.

NodeGhost does not host memory servers — you run your own. This is a privacy decision: your memories never touch NodeGhost infrastructure. The memory server is accessed through POKT's vector-memory service.

Features

Feature	Details
Document ingestion	PDF, TXT, MD — auto-chunked and embedded
Vector search	Cosine similarity using local embeddings
Multi-tenant	Per-user namespace isolation via hashed ng- key
Zero-knowledge	Stores ciphertext — server never sees plaintext when using Ghost encryption
Public namespaces	`knowledge-base` namespace is publicly queryable

// memory server

Deploy ng-memory-server

Option 1 — Docker (recommended)

docker run -d \
  -p 3100:3100 \
  -v /your/data/path:/data \
  -e PORT=3100 \
  -e DATA_DIR=/data \
  -e INFERENCE_URL=https://nodeghost.ai/v1 \
  -e PUBLIC_NAMESPACES=knowledge-base \
  nodeghost/ng-memory-server

Option 2 — Node.js directly

git clone https://github.com/Monitee/ng-kit
cd ng-kit/ng-memory-server
npm install
PORT=3100 DATA_DIR=./data node server.js

Environment variables

Variable	Default	Description
`PORT`	3100	Port to listen on
`DATA_DIR`	./data	Path to store SQLite databases
`INFERENCE_URL`	https://nodeghost.ai/v1	Inference endpoint for fact extraction
`PUBLIC_NAMESPACES`	knowledge-base	Comma-separated namespaces accessible without auth

NodeGhost vector-memory service: Once your memory server is running, register it as a POKT supplier to earn relay rewards on every memory query. See the nodeghost-supplier repo for setup instructions.

// memory server

Memory Admin Panel

The admin panel is a web-based interface for managing your memory server. No SSH required — upload documents, manage sources, browse memories, and view namespaces from any browser.

Access

The admin panel is served by the memory server itself at /admin. If you're using NodeGhost's hosted memory API proxy:

https://nodeghost.ai/memory-admin

Log in with:

Field	Value
Server URL	`https://nodeghost.ai/memory-api` (or your own server URL)
API Key	Your `ng-` key

Uploading documents

1

Select namespace

Choose knowledge-base for public business content, or default for private personal content.

2

Drag and drop files

Supports PDF, TXT, and MD files. Multiple files can be uploaded at once. Files are automatically chunked and embedded.

3

Verify in Sources tab

After upload, switch to the Sources tab to confirm your documents are ingested. Each source shows the number of chunks stored.

// memory server

Memory API reference

All authenticated endpoints accept your ng- key in the JSON body as owner_token. The body field is the canonical form because the POKT relay path strips Authorization headers — if you're calling through nodeghost.ai/v1/memory/*, put the key in owner_token. Authorization: Bearer still works for direct memory-server calls but not via the relay.

POST /v1/memory/public/recall is the only unauthenticated endpoint — it reads the global public knowledge base and accepts no token.

Every memory operation costs 25,000 CU — much cheaper than inference (400,000 CU).

Recall personal memories

POST /v1/memory/recall

{
  "owner_token": "ng-your-key",
  "query": "what are the user's preferences?",
  "namespace": "default",
  "topK": 5
}

Store a memory

POST /v1/memory/store

{
  "owner_token": "ng-your-key",
  "text": "User prefers dark mode",
  "namespace": "default"
}

Public knowledge-base recall (no auth)

POST /v1/memory/public/recall

{
  "query": "what is your refund policy?",
  "topK": 5
}

Upload a document

Multipart form-data does not survive the POKT relay path — the relay miner strips the multipart Content-Type boundary, so multipart uploads fail. Use base64-encoded JSON instead:

POST /v1/memory/upload

{
  "owner_token": "ng-your-key",
  "filename": "product-manual.pdf",
  "content_base64": "JVBERi0xLjQK...",
  "namespace": "knowledge-base"
}

Supported types: PDF, TXT, MD. Files are auto-chunked and embedded.

Browse stored memories

POST /v1/memory/list         { "owner_token": "ng-your-key", "namespace": "default" }
POST /v1/memory/sources      { "owner_token": "ng-your-key", "namespace": "knowledge-base" }
POST /v1/memory/namespaces   { "owner_token": "ng-your-key" }
POST /v1/memory/stats        { "owner_token": "ng-your-key" }

Clear all memories

POST /v1/memory/clear

{
  "owner_token": "ng-your-key"
}

Deletes every chunk across all namespaces for your account. Cannot be undone.

// business proxy

Business Proxy

The NodeGhost business proxy lets you add AI to your website without exposing your ng- key to customers. Your customers chat naturally — no API keys, no setup, no friction. The proxy runs on your server and handles everything.

Customer experience: A customer visits your flower shop, types "do you have red roses?", and gets an accurate answer from your product catalog. They never see an API key, never install anything, never know NodeGhost exists.

How it works

Customer browser → your proxy server → NodeGhost → POKT Network
                              ↑
                    ng- key injected here
                    (customer never sees it)

The proxy automatically queries your knowledge base before every customer message, injects the relevant context, and forwards to NodeGhost. Point any OpenAI-compatible chat widget at your proxy URL.

// business proxy

Deploy the proxy

Option 1 — Docker (recommended)

docker run -d \
  -p 3200:3200 \
  -e NG_KEY=ng-your-key-here \
  -e MEMORY_URL=http://your-memory-server:3100 \
  -e SYSTEM_PROMPT="You are a helpful assistant for Acme Corp." \
  -e ALLOWED_ORIGINS=https://yourwebsite.com \
  nodeghost/proxy

Option 2 — Node.js directly

git clone https://github.com/Monitee/ng-kit
cd ng-kit/ng-proxy
NG_KEY=ng-your-key-here \
MEMORY_URL=http://your-memory-server:3100 \
SYSTEM_PROMPT="You are a helpful assistant." \
node ng-proxy-server.js

Point your chat widget at the proxy

Base URL:  http://your-server:3200/v1
API Key:   any-string  (proxy ignores it, uses NG_KEY internally)
Model:     deepseek-chat

Compatible widgets: Any OpenAI-compatible chat UI works — Open WebUI, LibreChat, Chatbot UI, Flowise, or any widget that accepts a custom base URL.

// business proxy

Proxy configuration

Variable	Required	Default	Description
`NG_KEY`	✅ Yes	—	Your NodeGhost API key
`PORT`	No	3200	Port to listen on
`NG_URL`	No	https://nodeghost.ai	NodeGhost base URL
`MEMORY_URL`	No	—	Your ng-memory-server URL
`MODEL`	No	deepseek-chat	Default model
`NAMESPACE`	No	knowledge-base	Memory namespace to query
`SYSTEM_PROMPT`	No	Generic prompt	Your custom system prompt
`ALLOWED_ORIGINS`	No	*	Comma-separated CORS origins
`MAX_TOKENS`	No	800	Max tokens per response

Health check

GET http://your-server:3200/health

{
  "status": "ok",
  "model": "deepseek-chat",
  "namespace": "knowledge-base",
  "memory": true,
  "timestamp": "..."
}

// browser extension

NodeGhost Memory Extension

The NodeGhost browser extension adds persistent memory to any webpage that uses the NodeGhost API. It intercepts requests to nodeghost.ai/v1/chat/completions, stores facts from conversations, and recalls relevant context automatically.

What it does

Feature	Details
Auto-recall	Injects relevant memories into every chat request automatically
Auto-store	Extracts and stores key facts after each response
Keyword search	Fast keyword-based recall using chrome.storage
Works on nodeghost.ai	Active on the Ghost chat page and any page using the NodeGhost API
Privacy-first	Memories stored locally in your browser — never sent anywhere without your request

// browser extension

Install the extension

The extension is currently available as a developer install (Chrome/Edge). A Chrome Web Store listing is coming soon.

1

Download the extension

Clone or download ng-kit from github.com/Monitee/ng-kit and find the ng-extension/ folder.

2

Open Chrome extensions

Navigate to chrome://extensions in your browser and enable Developer Mode (top right toggle).

3

Load unpacked

Click "Load unpacked" and select the extension/ folder from the repo. The NodeGhost ghost icon will appear in your toolbar.

4

Enter your ng- key

Click the extension icon and enter your ng- key. The extension will start capturing and recalling memories automatically on nodeghost.ai.

Note: The extension works on nodeghost.ai only due to CORS restrictions. For use on your own domain, use the business proxy which handles memory server-side.

Setup guides & FAQ

Get your API key

Using the API

Path 1 — ng- key (Stripe or USDC)

Path 2 — Python (ng- key)

Path 3 — Native POKT stake

Supported model providers

Plans & CU metering

Plans

CU per call

Home Assistant overview

Local speech-to-text (Whisper)

NodeGhost AI (via POKT)

Local text-to-speech (Piper)

Install the integration

Step 1 — Install HACS

Step 2 — Install Extended OpenAI Conversation

Open HACS in Home Assistant

Restart Home Assistant

Add the integration

Configure NodeGhost credentials

Set as your conversation agent

Add local voice (Whisper + Piper)

Install Whisper (speech to text)

Go to Settings → Add-ons → Add-on Store

Start the add-on and enable on boot

Add the Wyoming integration

Install Piper (text to speech)

Remote access

Option A — Cloudflare Tunnel (free)

Get a free domain or use one you own

Install the Cloudflare Tunnel add-on in HA

Point the HA app at your tunnel URL

Option B — Nabu Casa ($6.50/month)

Run your own AI model through NodeGhost

Why run your own model?

How it works

Register your endpoint

Step 1 — Run an OpenAI-compatible model server

Step 2 — Register your endpoint with NodeGhost

Step 3 — Use NodeGhost as normal

Check your routing status

Remove your custom endpoint

Full privacy stack — zero third parties

The full stack

What this means for privacy

Recommended models for self-hosting

What is POKT?

Stake your app wallet

What you'll need

Stake your application

Using the gateway after staking

The X-App-Address header

USDC on Base

Hosted models

Text generation

Example request (curl)

Example request (Python)

Tools as POKT services

Web search

Parameters

Using web search as an agent tool

Tool discovery

Frequently asked questions

Ghost Chat

Memory & encryption

How it works (Ghost chat only)

Create a memory password

Facts extracted client-side

Encrypted before storage

Decrypted locally on recall

Memory feed indicators

Memory surfaces compared

Knowledge base

ng-memory-server

Features

Deploy ng-memory-server

Option 1 — Docker (recommended)

Option 2 — Node.js directly

Environment variables