Skip to content

FineYoungCannibals/ppp_connectors

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

ppp_connectors

A clean, modular set of Python connectors and utilities for working with both APIs and DBMS backends, unified by a centralized Broker abstraction and a consistent interface. Designed for easy testing, code reuse, and plug-and-play extensibility.

πŸ“š Table of Contents


πŸ“¦ Installation

pip install ppp-connectors

Copy the .env.example to .env for local development:

cp dev_env/.env.example dev_env/.env

Environment variables are loaded automatically via the combine_env_configs() helper.


πŸ”Œ API Connectors

All API connectors inherit from the shared Broker base class and:

  • Accept API credentials via env vars or constructor args
  • Send requests via get, post, etc.
  • Include support for logging, retry/backoff, and VCR integration

Example (URLScan)

from ppp_connectors.api_connectors.urlscan import URLScanConnector

scanner = URLScanConnector(load_env_vars=True)
result = scanner.scan(
    url="https://example.com",
    visibility="public",
    tags=["example", "demo"],
    custom={"foo": "bar"},  # Arbitrary API field passed via **kwargs
    headers={"X-Custom-Header": "my-value"}  # httpx headers
)
print(result.json())

Customizing API Requests with **kwargs

All connector methods accept arbitrary keyword arguments using **kwargs. These arguments are passed directly to the underlying httpx request methods, enabling support for any feature available in httpx β€” including custom headers, query parameters, timeouts, authentication, and more. Additionally, for APIs that accept arbitrary fields in their request body (like URLScan), these can also be passed as part of **kwargs and will be merged into the outgoing request. This enables full control over how API requests are constructed without needing to modify connector internals.

Example (URLScan with custom headers and params)

result = scanner.scan(
    url="https://example.com",
    visibility="unlisted",
    headers={"X-Custom-Header": "my-value"},
    params={"pretty": "true"}
)

This pattern allows flexibility without needing to subclass or modify the connector.

Proxy Awareness

API connectors inherit from the Broker class and support flexible proxy configuration for outgoing HTTP requests. You can set proxies in multiple ways:

  • a single proxy parameter (applies to all requests),
  • a per-scheme mounts parameter (e.g., separate proxies for http and https as a dictionary),
  • or environment variables (from .env or OS environment, specifically HTTP_PROXY and HTTPS_PROXY).

Proxy precedence: mounts > proxy > environment source (.env via load_env_vars=True, else OS environment if trust_env=True) > none.

  • If you provide explicit mounts, these override all other proxy settings.
  • If you set proxy, it overrides environment proxies but is overridden by mounts.
  • If neither is set, and load_env_vars=True, proxy settings are loaded from .env via combine_env_configs().
    • If both .env and OS environment have the same variable, OS environment takes precedence.
  • If no explicit proxy or mounts are set but trust_env=True, HTTPX will use OS environment proxy settings (including NO_PROXY).

Examples:

Using a single proxy:

from ppp_connectors.api_connectors.urlscan import URLScanConnector
conn = URLScanConnector(proxy="http://myproxy:8080")

Using per-scheme mounts:

conn = URLScanConnector(mounts={"https://": "http://myproxy:8080", "http://": "http://myproxy2:8888"})

Loading proxy from .env:

# .env file contains: HTTP_PROXY="http://myproxy:8080"
conn = URLScanConnector(load_env_vars=True)
# Uses HTTP_PROXY from .env even if not in OS environment.

Note: Any changes to proxy settings require re-instantiating the connector for changes to take effect.

SSL Verification and Per-Request Options

You can now pass any httpx.Client keyword arguments (such as verify=False, http2=True) when instantiating a connector. These options will be applied to all requests made by that connector.

Additionally, per-request keyword arguments can be passed to methods like .get(), .post(), etc., and will be forwarded to httpx.Client.request for that single call.

Setting verify=False disables SSL verification and can be useful for testing against servers with self-signed certificates, but should not be used in production unless you understand the security implications.

Examples:

Disable SSL verification at the connector level:

conn = URLScanConnector(verify=False)
response = conn.get("https://self-signed.badssl.com/")
print(response.status_code)

Disable SSL verification for a single request:

conn = URLScanConnector()
response = conn.get("https://self-signed.badssl.com/", verify=False)
print(response.status_code)

Enable HTTP/2:

conn = URLScanConnector(http2=True)
response = conn.get("https://nghttp2.org/httpbin/get")
print(response.http_version)

πŸ—ƒοΈ DBMS Connectors

Each database connector follows a class-based pattern and supports reusable sessions, query helpers, and in some cases bulk_insert.

MongoDB

from ppp_connectors.dbms_connectors.mongo import MongoConnector

conn = MongoConnector("mongodb://localhost:27017", username="root", password="example")
conn.bulk_insert("mydb", "mycol", [{"foo": "bar"}])

Elasticsearch

# The query method returns a generator; use list() or iterate to access results
from ppp_connectors.dbms_connectors.elasticsearch import ElasticsearchConnector

conn = ElasticsearchConnector(["http://localhost:9200"])
results = list(conn.query("my-index", {"query": {"match_all": {}}}))
for doc in results:
    print(doc)

ODBC (e.g., Postgres, Teradata)

For automatic connection handling, use ODBCConnector as a context manager

from ppp_connectors.dbms_connectors.odbc import ODBCConnector

with ODBCConnector("DSN=PostgresLocal;UID=postgres;PWD=postgres") as db:
   rows = conn.query("SELECT * FROM my_table")
   print(list(rows))

If you'd like to keep manual control, you can still use the .close() method

from ppp_connectors.dbms_connectors.odbc import ODBCConnector

conn = ODBCConnector("DSN=PostgresLocal;UID=postgres;PWD=postgres")
rows = conn.query("SELECT * FROM my_table")
print(list(rows))
conn.close()

Splunk

from ppp_connectors.dbms_connectors.splunk import SplunkConnector

conn = SplunkConnector("localhost", 8089, "admin", "admin123", scheme="https", verify=False)
results = conn.query("search index=_internal | head 5")

πŸ§ͺ Testing

βœ… Unit tests

  • Located in tests/<connector_name>/test_unit_<connector>.py
  • Use mocking (MagicMock, patch) to avoid hitting external APIs

πŸ” Integration tests

  • Use VCR.py to record HTTP interactions
  • Cassettes stored in: tests/<connector_name>/cassettes/
  • Automatically redact secrets (API keys, tokens, etc.)
  • Marked with @pytest.mark.integration
pytest -m integration

🧼 Suppress warnings

Add this to pytest.ini:

[pytest]
markers =
    integration: marks integration tests

πŸ§‘β€πŸ’» Contributing / Adding a Connector

To add a new connector:

  1. Module: Place your module in:

    • ppp_connectors/api_connectors/ for API-based integrations
    • ppp_connectors/dbms_connectors/ for database-style connectors
  2. Base class:

    • Use the Broker class for APIs
    • Use the appropriate DBMS connector template for DBMSs
  3. Auth: Pull secrets using combine_env_configs() to support .env, environment variables, and CI/CD injection.

  4. Testing:

    • Add unit tests in: tests/<name>/test_unit_<connector>.py
    • Add integration tests in: tests/<name>/test_integration_<connector>.py
    • Save cassettes in: tests/<name>/cassettes/
  5. Docs:

    • Add an example usage to this README.md
    • Document all methods with docstrings
    • Ensure your connector supports logging if enable_logging=True is passed
  6. Export:

    • Optionally expose your connector via __init__.py for easier importing

πŸ› οΈ Dev Environment

git clone https://github.com/FineYoungCannibals/ppp_connectors.git
cd ppp_connectors

cp .env.example .env

python -m venv .venv
source .venv/bin/activate

poetry install  # if using poetry, or `pip install -e .[dev]`

pytest           # run all tests
black .          # format code
flake8 .         # linting

πŸ” Secrets and Redaction

Sensitive values like API keys are redacted using the AUTH_PARAM_REDACT list in conftest.py. This ensures .yaml cassettes don’t leak credentials.

Redacted fields include:

  • Query/body fields like api_key, key, token
  • Header fields like Authorization, X-API-Key
  • URI query parameters

βœ… Summary

  • Centralized request broker for all APIs
  • Robust DBMS connectors
  • Easy-to-write unit and integration tests with automatic redaction
  • Environment-agnostic configuration system
  • VCR-powered CI-friendly test suite

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages