SysOps Engineer
ActionIQ unifies customer data and empowers marketers to deliver relevant customer experiences. Our product features self-service audience discovery and true cross-channel orchestration powered by AI-driven insights and decisions. This product and the platform it operates on is a complex and feature-rich distributed data system which we offer as a multi-cloud SaaS solution to our clients.
ActionIQ is looking for a Systems Operations (SysOps) Engineer to work in our newly formed SysOps Team. Our ideal candidate has prior experience working in a production support or operations team, and understands what it takes to operate a SaaS product.
As a member of the SysOps team, you will be the bridge that allows for seamless collaboration between SysOps, Engineering and Customer Engagement. Your impact will be felt throughout the company, visible both internally and externally. As a critical team in a startup, your input on improvements to the tools and processes used by your team will be valued - and expected!
Tools used by the team include
- AWS & Google Cloud Platform
- Atlassian suite(Confluence, Jira, Statuspage)
- Git
- Linux command-line utilities and BASH
- Datadog and Prometheus
- PagerDuty
- Python
What You'll Do
- Support cloud infrastructure and automation in collaboration with multiple software teams
- Troubleshoot and resolve challenging operational incidents and issues.
- Participate in a monthly on-call rotation.
- Collaborate with internal stakeholders at all levels of technical skill.
- Work with the SysOps Manager to define and report on key performance indicators.
- Identify gaps in information flow that impact our time to recovery(MTTR) and time to detection(MTTD).
- Maintain compliance with security standards and audit requirements.
- Clearly document Incident response processes and tools.
- Help ActionIQ continue to get better at incident management and response by leading After Action Reviews.
- Maintain a status page, providing timely and relevant information to our customers.
- Use telemetry and monitoring tools to communicate status to the rest of the organization.
Requirements
- Experience in performing system administration duties in cloud environments
- Basic command line linux skills
- Excellent spoken and written communication skills
- Basic network troubleshooting skills
- Excellent general problem solving and troubleshooting skills
- Strong interest in the areas of DevOps, Site Reliability Engineering, Incident Response, Resilience Engineering, and Technical Operations.
Nice to Have
- Experience automating processes.
- Experience working in a startup environment highly desired.
- Experience supporting an enterprise SaaS product.
- Experience working in an SLA-driven environment.
- Experience operating within compliance and security frameworks like SOC 2, ISO 27001, HIPAA, NIST 800-53, or similar.
Benefits
ActionIQ is committed to building an inclusive, equitable, and diverse organization. We embrace equal opportunity for all applicants and seek to foster a culture of belonging for our employees. We recognize and appreciate that the more inclusive we are, the better we will function as a team. AIQ welcomes qualified applicants of any race, color, ancestry, religion, sex, national origin, gender identity, gender expression, age, marital or family status, disability, military veteran status, and any other status or background. Join us on our journey to build a product that will help our customers deliver memorable experiences that will drive loyalty and growth.