Observability Lead at Nsano
Full Time Job @Ghana Careers 3 posted 2 weeks ago in Engineering , in IT & Telecoms Share this jobJob Detail
- Job ID 43945
- Career Level Others
- Experience 5 Years
- Gender Any
- Qualifications Bachelor's Degree
Job Description
Observability Lead
Location: Accra, Ghana
Employment Type: Full-Time
Application Deadline: 25 May 2026
Company Profile
Nsano is a leading fintech company delivering secure and innovative digital payment solutions across Africa. We empower businesses through reliable financial technology infrastructure, seamless integrations, and scalable payment services. At Nsano, we value innovation, operational excellence, and technology-driven impact.
We are looking for an experienced and proactive Observability Lead to drive the company’s observability strategy, strengthen platform reliability, and ensure real-time visibility across critical systems and services.
Latest Career Advice
- How to Ask Smart Questions at the End of a Job Interview in Ghana May 24, 2026
- What to Say When You Don’t Know the Answer in a Job Interview May 20, 2026
- The Biggest CV Lies Ghanaian Recruiters Instantly Detect (And Why They Reject Such Candidates) May 11, 2026
- The Soft Skills Crisis in Ghana: Why Qualified Professionals Are Still Struggling to Get Hired May 8, 2026
Role Summary
As the Observability Lead, you will oversee enterprise-wide monitoring, alerting, incident detection, and operational intelligence across infrastructure, applications, cloud environments, and production systems. You will work closely with Engineering, DevOps, Security, and Product teams to improve reliability, reduce downtime, and enhance overall platform performance.
Key Responsibilities
Observability Strategy & Ownership
- Develop and lead the company-wide observability strategy across infrastructure, applications, cloud platforms, databases, and internal services.
- Establish monitoring standards, governance frameworks, and best practices for production workloads.
- Ensure real-time visibility into system health, performance, availability, and capacity.
- Drive a proactive reliability culture through data-driven monitoring and operational insights.
Monitoring & Alerting Management
- Ensure comprehensive monitoring coverage across all critical production services.
- Design, configure, and maintain dashboards, logs, metrics, alerts, and distributed tracing systems.
- Continuously optimize alert thresholds to reduce noise and eliminate false positives.
- Maintain centralized monitoring systems accessible to relevant technical teams.
Incident Detection & Operational Response
- Ensure operational incidents are identified internally before customer impact whenever possible.
- Lead incident response activities during outages, degradations, and system anomalies.
- Coordinate cross-functional teams throughout incident resolution processes.
- Conduct post-incident reviews, root cause analysis (RCA), and corrective action planning.
Performance Monitoring & Optimization
- Monitor system latency, throughput, resource utilization, and application performance metrics.
- Identify performance bottlenecks and collaborate with engineering teams on remediation initiatives.
- Support load readiness assessments, scaling strategies, and capacity planning activities.
- Continuously improve platform stability, responsiveness, and operational efficiency.
Reporting & Operational Insights
- Prepare weekly and monthly reports on system health, uptime, incident trends, and operational risks.
- Develop executive dashboards that provide leadership visibility into platform performance.
- Use operational intelligence and monitoring data to recommend strategic improvements and investment priorities.
Collaboration & Leadership
- Partner with Engineering, DevOps, Security, and Product teams to embed observability into all deployments and production processes.
- Support teams with troubleshooting, diagnostics, and production readiness reviews.
- Mentor engineers on monitoring best practices, reliability engineering, and observability tooling.
- Serve as the subject matter expert for observability, reliability monitoring, and operational intelligence.
Requirements
Education & Experience
- Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field.
- Minimum of 5 years’ experience in Observability, Site Reliability Engineering (SRE), DevOps, Infrastructure Monitoring, or Production Operations.
- Experience within fintech, payments, telecom, banking, or other mission-critical environments is preferred.
Technical Skills
- Hands-on experience with observability and monitoring tools such as Grafana, Prometheus, Datadog, New Relic, Signoz, ELK Stack, Splunk, AppDynamics, or similar platforms.
- Strong understanding of metrics, logs, traces, and alerting systems.
- Experience managing Linux servers, cloud platforms (AWS, Azure, or GCP), and containerized environments.
- Solid understanding of networking, databases, APIs, and distributed systems architecture.
- Scripting experience with Python, Bash, or similar languages is an advantage.
Soft Skills
- Strong analytical thinking and troubleshooting capabilities.
- Ability to remain calm and decisive during incidents and high-pressure situations.
- Excellent communication and stakeholder management skills.
- Strong leadership mindset with ownership, accountability, and initiative.
- High attention to detail and commitment to continuous improvement.
What Success Looks Like in This Role
- Production issues are identified and resolved before customers experience disruption.
- Leadership maintains real-time confidence in platform health, uptime, and operational stability.
- Engineering teams rely on actionable alerts, strong dashboards, and reliable operational insights.
- System performance continuously improves through proactive, data-driven optimization.
- Downtime and recurring incidents reduce significantly over time.
What We Offer
- Opportunity to lead observability and reliability initiatives within a fast-growing fintech environment.
- Collaborative and innovation-driven work culture.
- Professional growth and leadership development opportunities.
- Competitive compensation and benefits package.
How to Apply
Apply online by clicking on the application button.
Other jobs you may like
-
Heavy-Duty Truck (Trailer) Driver at Kasapreko PLC
- @ Kasapreko Company Limited
- Accra, Greater Accra, Ghana
