Deploy the Connector
Run the Pauhu® data connector on your own infrastructure. Docker, Kubernetes, or Azure Arc.
Prerequisites
1 Subscribe
Get your API key from staging.pauhu.eu/pricing or Azure Marketplace. Your key will be emailed after checkout.
2 Runtime
Docker 20.10+, Podman, or any OCI-compatible container runtime.
3 Resources
4 GB RAM minimum (8 GB recommended). 50 GB storage for the full annotated dataset.
Quick Start
Pull and run
# Pull the connector image
docker pull pauhu/data:latest
# Run with your API key
docker run -d \
--name pauhu-connector \
-p 8003:8003 \
-e PAUHU_API_KEY="pk_..." \
-e PAUHU_SOURCES="eurlex,ted,iate" \
-e PAUHU_LANGUAGES="en,fi,sv,de,fr" \
-v pauhu-data:/data \
pauhu/data:latest
Verify
# Health check
curl http://localhost:8003/health
# Query a source
curl http://localhost:8003/v1/eurlex/latest?limit=5
Configuration
| Variable | Required | Description |
|---|---|---|
PAUHU_API_KEY | Yes | Your API key from checkout (pk_...) |
PAUHU_SOURCES | No | Comma-separated source list. Default: all 20 sources. |
PAUHU_LANGUAGES | No | Comma-separated language codes. Default: all 24 EU languages. |
PAUHU_SYNC_INTERVAL | No | Sync frequency in minutes. Default: 15 |
PAUHU_PORT | No | HTTP port. Default: 8003 |
PAUHU_DATA_DIR | No | Data directory. Default: /data |
Available Sources
Use any combination of these source identifiers in PAUHU_SOURCES:
eurlex EUR-Lex regulations, directives, decisions
ted Tenders Electronic Daily (public procurement)
iate IATE terminology (2.4M multilingual terms)
eurostat Eurostat statistical datasets
curia Court of Justice case law
ecb European Central Bank decisions
commission European Commission documents
consilium Council of the EU documents
europarl European Parliament resolutions
publications EU Publications Office catalog
dataeuropa EU open data portal
echa European Chemicals Agency
ema European Medicines Agency
epo European Patent Office
oeil Legislative Observatory
cordis EU research projects
whoiswho EU institutional directory
dpp Digital Product Passport classifications
lex National law databases (28 countries)
wiki Wikidata EU entity graph
Eclipse Dataspace Connector
For connector-to-connector data exchange via the European Language Data Space, the Pauhu connector supports the Eclipse Dataspace Protocol.
# Enable LDS protocol
docker run -d \
--name pauhu-connector \
-p 8003:8003 \
-p 8282:8282 \
-e PAUHU_API_KEY="pk_..." \
-e PAUHU_LDS_ENABLED="true" \
-e PAUHU_LDS_PARTICIPANT_ID="your-lds-participant-id" \
-v pauhu-data:/data \
pauhu/data:latest
Port 8282 serves the Eclipse Dataspace Protocol endpoints. Register your connector in the LDS Participant Registry to enable contract negotiation with other participants.
Kubernetes
apiVersion: apps/v1
kind: Deployment
metadata:
name: pauhu-connector
spec:
replicas: 1
selector:
matchLabels:
app: pauhu-connector
template:
metadata:
labels:
app: pauhu-connector
spec:
containers:
- name: connector
image: pauhu/data:latest
ports:
- containerPort: 8003
env:
- name: PAUHU_API_KEY
valueFrom:
secretKeyRef:
name: pauhu-secrets
key: api-key
- name: PAUHU_SOURCES
value: "eurlex,ted,iate"
volumeMounts:
- name: data
mountPath: /data
volumes:
- name: data
persistentVolumeClaim:
claimName: pauhu-data-pvc
Support
Questions about deployment? Contact support@pauhu.eu or check the API documentation.