Position Title: Systems Administrator Team Lead Location: Ottawa, ON (377 Dalhousie Street) Work Model: Hybrid - 4 days onsite, 1 day work from home
About Rebel OUR CUSTOMERS BRING A VISION - WE BRING THE PLATFORM TO SHARE IT ONLINE.
We believe that those who contribute make us better. It’s why we create simple, useful tools to empower participation in the world’s bravest communication space: the Internet. We are experts in domain names and the products that make the most of them. This helps our customers showcase their ideas, stories, services and contributions to the world. Our manifesto: Be Thoughtful, Be Simple, Be Brave.
Role Overview We’re hiring a Systems Administrator Team Lead to lead our Systems Team, responsible for reliable, secure, and scalable operations across IT, cloud/server infrastructure, hosting platforms, and Live Production Support (LPS). You’ll combine hands-on systems administration and platform engineering with people leadership—setting priorities, improving processes, and ensuring strong service delivery and operational excellence. This role includes participation in an LPS on-call rotation, with after-hours support when required to protect production systems and services.
What You’ll Do Lead and develop a small Systems Team
Coach, mentor, and support team members through regular 1:1s, feedback, and growth plans.
Set clear ownership, expectations, and escalation paths, including on-call coverage.
Run the team using agile practices
Act as the team’s Scrum Master: facilitate planning, standups, retrospectives, and backlog refinement.
Manage intake and prioritization with stakeholders to balance roadmap work, technical debt, and urgent operational needs.
Own IT Operations
Oversee user lifecycle and access management, endpoint tooling, and operational standards.
Improve documentation, runbooks, and repeatable processes for common requests and incidents.
Drive security operations
Support security controls and operational hygiene (patching, vulnerability management, incident response readiness).
Promote least-privilege access, secure configurations, auditing, and continuous improvement.
Manage cloud and server infrastructure
Administer and improve AWS and other infrastructure platforms (IAM, networking, compute, storage, monitoring).
Maintain Windows and Linux environments, ensuring stability, patching, hardening, and automation.
Own hosting platforms
Operate and improve hosting environments including Plesk and WordPress, focusing on uptime, performance, and security.
Standardize deployments, upgrades, backups, and troubleshooting processes.
Advance platform engineering / DevOps
Improve delivery and operational workflows using Git and modern DevOps practices.
Increase automation and repeatability (scripting, infrastructure-as-code patterns, CI/CD improvements where applicable).
Ensure high availability and disaster readiness
Maintain and test backups, DR plans, and recovery procedures.
Track availability, lead incident reviews/root cause analysis, and implement reliability improvements.
Provide Live Production Support (LPS)
Participate in LPS to ensure production stability and rapid incident response.
Join an on-call rotation and provide after-hours support when required for production incidents and escalations.
Improve LPS effectiveness through better monitoring, alerting, runbooks, and incident workflows.
What You Bring Team leadership & people management
Experience leading a small technical team with strong coaching, communication, and accountability.
Ability to balance hands-on technical work with planning, delegation, and stakeholder alignment.
Scrum Master / agile delivery
Proven ability to run agile ceremonies, maintain a prioritized backlog, and drive continuous improvement.
Systems administration (Windows & Linux)
Strong operational experience administering and troubleshooting Windows and Linux systems.
Familiar with patching, hardening, identity/access controls, and automation best practices.
Understanding of cloud-native operational patterns (scalable design, resilience, monitoring, automation).
Platform engineering / DevOps & Git
Experience with DevOps practices and tooling to improve reliability and delivery.
Comfort with Git workflows and infrastructure/platform change management.
Hosting (Plesk / WordPress)
Hands-on experience managing Plesk and WordPress hosting environments, including upgrades, security, backups, and performance troubleshooting.
Disaster recovery & availability management
Experience designing and operating DR processes, backups, and availability improvements.
Ability to run incident response, lead post-incident reviews, and reduce recurrence through RCA and follow-through.
Live Production Support (LPS) / on-call readiness
Comfortable supporting production systems under pressure with clear communication and structured incident management.
Willingness to participate in an on-call rotation and provide after-hours support when required.
What We Offer The opportunity to work in an atmosphere that truly rewards hard work and creative thinking. We offer a competitive salary, benefits, and opportunities for growth and advancement within our company. As if that wasn’t enough we also offer a smoke-free environment, a downtown location, a fully stocked fridge free for all staff. If Rebel sounds like the perfect workplace for you, there is only one question- What are you waiting for?
About This Role This role represents an existing vacancy.
Compensation CAD $90,000 - $130,000 annually, plus benefits.
How We Hire As part of this recruitment process, we use automated or artificial intelligence–enabled tools to support the screening and assessment of candidates’ applications. All hiring decisions are made by our team.