Job Description
Fully Remote | Any Time Zone
We are seeking a highly skilled Infrastructure Engineer to serve as the primary technical owner for a rapidly growing application environment. This role is critical in ensuring performance, reliability, and scalability across a hybrid infrastructure (on-prem and Azure). As the app owner from an infrastructure and observability perspective, this engineer will be accountable for identifying, predicting, and preventing issues before they impact the front end or end users.
This role is especially focused on AppDynamics-building dashboards, running performance reports, and developing predictive insights to proactively address capacity, performance, and infrastructure risks as the application continues to scale.
Key Responsibilities
Act as the infrastructure owner for the application, ensuring visibility across both front end and back end systems
Own AppDynamics configuration and reporting, including:
o Creating and maintaining dashboards specific to application growth
o Monitoring performance, capacity, and infrastructure health
o Delivering regular reports highlighting risks, trends, and upcoming constraints
Provide proactive, predictive insights-for example, identifying when increased usage or growth will cause performance degradation or outages if not addressed
Partner closely with application owners and technical leads to surface backend issues before they impact users
Support and optimize the database server farm, infrastructure performance, and system stability
Identify gaps in monitoring and observability, and continuously improve visibility across systems
Reduce reliance on application technical leads for infrastructure performance tracking by fully owning observability and reporting in AppDynamics
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Skills and Requirements
5+ years of infrastructure or site reliability engineering experience
Strong hands-on experience with AppDynamics, including dashboard creation, alerting, reporting, and analysis
Experience supporting database server farms and shared infrastructure environments
Proven ability to analyze application performance trends and create predictive modeling around future risks and bottlenecks
Experience working in hybrid environments (both on prem and Azure)
Strong understanding of application ownership and end to end responsibility, especially when working within a shared infrastructure model