Front-Line Deployment & Integration: Lead the end-to-end installation, configuration, and production deployment of AI Guardrail products within clients' specific infrastructure boundaries (On-premise, VPC, or hybrid cloud). Connect core products with clients' existing systems, including internal APIs, databases, and security management platforms. - Environment Deployment & Troubleshooting: Optimize containerized configurations and performance within client environments (on-prem servers, private clouds, K8s clusters). Take ownership of troubleshooting, with a particular focus on GPU resource scheduling and restricted/network-isolated environments. - Scenario-Based Solution Development: Design and implement data processing pipelines and automation scripts tailored to clients' unique business scenarios, specific AI models, or Agent execution frameworks (e.g., Harness). Automate repetitive tasks to increase efficiency during client onboarding. - Technical Trust, Consulting & Training: Serve as the technical bridge between enterprise clients and internal product teams. Communicate closely with client tech and security leads to explain AI vulnerability defense mechanisms, address compliance concerns, and translate security policies into engineering requirements. Conduct technical training sessions to ensure smooth handoffs. - Product Feedback & Engineering Feed-Forward: Collaborate closely with the engineering team to improve reliability and operational efficiency. Translate real-world pain points, extreme edge cases, client feedback, and performance bottlenecks encountered on the front line into actionable product specifications to optimize our core products. ###
Requirements
Deployment Expertise: Strong, hands-on experience in executing client-side/on-premise deployments is highly essential. - Infrastructure & Operations: Highly proficient in container technologies (Docker, Docker Compose) and orchestration tools. Hands-on experience with Kubernetes (K8s) deployment and troubleshooting (e.g., Pod scheduling, network configuration, Storage Class). Proficiency in automation platforms like Ansible and Terraform. - Programming & Architecture: Proficiency in programming languages such as Python (with the ability to quickly read/refactor code), Bash, or Go, backed by strong system architecture thinking. Experience with AI pair programming tools (e.g., OpenAI). - Networking & Troubleshooting: In-depth understanding of network concepts (security, firewalls, VPNs, proxy configurations) in enterprise environments. Excellent log analysis and Linux system troubleshooting abilities, capable of independently identifying root causes in restricted or offline client environments. - Monitoring Solutions: Advanced knowledge of monitoring and alerting solutions (Prometheus, Grafana, ELK stack) to track deployment health and system reliability. - Soft Skills & Travel: Exceptional client-facing communication and stakeholder management skills. Ability to travel and accommodate short-term on-site stints or local deployments at client premises (e.g., large enterprises, financial institutions, high-tech manufacturing in Asia, UAE, etc.) based on project needs.