AI Developer (Python)
Posted 2025-09-12
Remote, USA
Full Time
Immediate Start
This is a remote position.
SUMMARYThe AI Developer (Python) role focuses on designing, developing, and optimizing AI/ML solutions in local-first environments, prioritizing data security, offline accessibility, and system autonomy. This role is ideal for a Python developer with a deep understanding of embedded storage systems, offline inference, and graph-based data models, who thrives in non-cloud, privacy-focused setups. The ideal candidate brings hands-on expertise in model training, local storage, and graph computation—supporting projects that require powerful offline data science solutions.
JOB RESPONSIBILITIES
- Design and implement AI/ML pipelines using Python with local storage architecture.
- Handle local model training, versioning, optimization, and offline inference using frameworks such as scikit-learn, XGBoost, LightGBM, and optionally PyTorch or TensorFlow.
- Store and process data using SQLite, DuckDB, and flat file formats (CSV, JSON, Parquet, Pickle).
- Manage secure local model and data caching for privacy-compliant workflows.
- Build and manage local graph databases using Neo4j Community Edition, applying Cypher queries for relationship analysis and entity resolution.
- Implement dimensionality reduction techniques such as PCA, t-SNE, or UMAP to streamline local ML performance.
- Ensure secure handling of sensitive data using encrypted local storage, with awareness of HIPAA, GDPR, and other compliance standards.
- Automate offline AI workflows using Python scripting, cron jobs, or Windows Task Scheduler.
- Document development processes, model lifecycle, and preprocessing techniques.
- (Optional) Develop lightweight local web interfaces with Flask or FastAPI.
- (Optional) Deploy models on edge devices like Raspberry Pi or Jetson Nano.
QUALIFICATIONS
- Bachelor’s degree in Computer Science, Data Science, Engineering, or equivalent work experience.
- 5+ years of Python development experience, with strong skills in writing clean, modular code.
- Proven experience with:
- scikit-learn, XGBoost, LightGBM.
- Local-first data workflows and embedded DBs (SQLite, DuckDB).
- Local model saving/loading with joblib, pickle, torch.save, etc.
- Neo4j graph database and Cypher query language.
- Secure offline environments without external API dependencies.
- Strong understanding of:
- Data preprocessing (handling nulls, normalization, feature engineering).
- Local model inference optimization (CPU/GPU).
- Data privacy regulations (HIPAA, GDPR).
- Git version control.
- Proficiency with Python scripting for automation.
- Comfort working in cloud-disconnected environments.
- Bonus: Experience with NetworkX, ArangoDB, Dgraph, or graph-tool.
- Bonus: Background in edge/IoT AI deployment.
JOB REQUIREMENTS
- Should be willing to accept a long-term work-from-home arrangement.
- Should be amenable to a permanent night shift schedule.