Our systems and applications are exceedingly available and exceptionally performing. We are into the business, which touch everyone s pocket if card is being used for transaction. Those transactions are happening within fraction of seconds. In addition, within a second, thousands of transactions happen concurrently. Our service is not limited to one junction, we are in 200+ countries.
What are responsibilities of Product / Site Reliability Engineer?
-
The primary responsibility of a Site Reliability Engineer is to ensure that the environment is secure and safe. All security findings should be remediated within the required resolution date defined by governance.
-
We do not allow outages, even for a second. If any issue arises, as the owner of the environment, we take the necessary steps to ensure those environments are up and running. Root cause analysis should be completed within hours. We ensure that findings are remediated in the Production environment after all tests and checks in lower environments.
-
As the owner of the environment, we keep track of all activities planned or happening in our environments. We are responsible for deploying new code in the environment.
-
We regularly monitor and analyze our environment. If there is a manual task, we automate it. We are increasing self-heal capabilities and will continue to do so until environments become auto-heal.
-
If a new service is coming under our support or if the migration of an old environment is going to happen to new technologies, we start interacting with product developers to plan for production.
-
As our business operates around the clock, we work in shifts and synchronize with multiple locations and multiple tracks (sub-teams).
-
We ensure that every activity is recorded according to the incident or change management process. Technical and related run books need to be prepared and shared with the team.
Basic Qualifications:-
- Bachelors degree with 1-2 years of relevant work experience
Preferred Qualifications:-
- B.E/B.Tech in IT or Computer S