Skopje, Bitola

Site Reliability Engineer (Lead/Senior)

Valtech is a global company focused on business transformation powered by digital innovation. With more than 5.000 doer’s and maker’s working out from our 50+ offices in 18 countries, Valtech has established itself as one of the largest independent global groups aiming at creating digital experiences that improves human lives while transforming the future of our clients’ businesses. We work with some of the world’s best-known brands from across the retail, manufacturing, distribution, mobility, travel, health, public and finance sectors. And our promise to the market is that we get things done. Together. We transform by doing!

We have over the years created an energising workplace, where we operate under the guiding principles of craftsmanship, autonomy, trust, inclusiveness, openness and accountability. Within our industry there are many agencies that offer the same tools and methodologies that we do. The secret to our continued success is our people.

Our growth ambitions and our move into the space of the elite, global digital agencies, won’t be driven by us discovering a new platform or system that no one else has, or that we master all sorts of services. It will be because we have the best, most knowledgeable, most committed, most passionate and most trustworthy people onboard.


Lead Site Reliability Engineer

  • A proven background in Reliability, covering Observability, Reliability and Performance.
  • Commercial experience of observability, from data ingestion to alerting and issue resolution. Elastic highly desirable but AppDynamics, Datadog etc considered.
  • Comfortable with logging frameworks, log manipulation and shipping.
  • Capable of designing, testing and implementing resilience on new and existing AWS solutions.
  • Performance testing a bonus but not a must have.
  • Comfortable coding in one or more languages such as Python, Go, Java, NodeJS etc.
  • Commercial experience of using AWS Services including EC2, ECS, Serverless (Lambda);
  • You’ll know how to debug a complex, high availability production environments;
    Networking knowledge, load balancing, TCP/HTTP etc.;
  • Comfortable with Linux operating systems and able to create and maintain shell scripts
  • Demonstrable range of in-depth technical knowledge/experience of handling complex software and platform architectures;
  • In-depth level of technical knowledge/experience in building cloud solutions that have security, reliability, scalability, high availability and concurrency built-in from the outset.
  • Background and relevant current experience in a hands on Observability/SRE/Platform Engineering role is needed;
  • Knowledge of IaaS deployment tools such as Terraform;
    Competent in using source control, preferably Git based.

Optional:

  • Elastic Observability or OpenTelemetry experience
  • Working knowledge of continuous integration systems such as Jenkins and GitLab;
  • Elastic search internals experience a big plus.
  • Performance Test experience.
  • AWS Certification.
  • Docker development experience is desirable.

The Role:

As a Lead Site Reliability Engineer you’ll be expected to:

  • Develop and maintain Observability solutions using Elastic Observability and OpenTelemetry.
  • Design, test and implement resilience on new and existing AWS solutions.
  • Assist tribes with performance testing.
  • Build and maintain solutions developed on AWS.
  • Help the tribes enable observability features and develop solutions where none currently exist. Also document the process for future reference.
  • Assist on the creation and maintenance of pipelines to manage the observability components;
  • Monitor and reporting usage of our cloud solutions;
  • Advise on the selection of the most appropriate technologies for the task;
  • Ensure delivery pipeline for your IaaS code has optimal quality controls built-in to support testing, deployment, reporting and task management;
  • Make a selection of appropriate quality controls to complete assigned tasks, including; code driven deployment; infrastructure deployment; automated testing; and effective operational monitoring,
  • alerting and incident responses;
  • Supply appropriate information and analysis to support resolution of issues and incidents with the tribes Observability.


Senior Site Reliability Engineer

  • A background in Reliability, covering Observability and Reliability; with Performance as a nice to have.
  • Experience of observability, from data ingestion to alerting and issue resolution using any Observability tools.
  • Able to work with logging frameworks, log manipulation and shipping.
  • Capable of testing and implementing resilience on new and existing AWS solutions.
  • Performance testing a bonus but not a must have.
  • Comfortable coding in one or more languages such as Python, Go, Java, NodeJS etc.
  • Experience of using Cloud based services preferred, AWS a bonus.
  • You’ll know the basics of networking, such as IP addressing, firewalls, load balancers, HTTP etc.
  • Comfortable with Linux operating systems and able to create and maintain basic shell scripts
  • Technical knowledge/experience in building cloud solutions that have security, reliability, scalability, high availability and concurrency built-in from the outset.Some knowledge of IaaS deployment tools such as Terraform is a bonus.
  • Competent in using source control, preferably Git based.
  • Background and relevant current experience in a hands- Observability/SRE/Platform Engineering role is preferred.

Optional:

  • Elastic Observability or OpenTelemetry experience
  • Working knowledge of continuous integration systems such as Jenkins and GitLab;
  • Elastic search internals experience a big plus.
  • Performance Test experience.
  • AWS Certification.


The Role

As a Senior Site Reliability Engineer you’ll be expected to:

  • Develop and maintain Observability solutions using Elastic Observability and OpenTelemetry.
  • Test and implement resilience on new and existing AWS solutions.
  • Assist tribes with performance testing (nice to have).
  • Build and maintain solutions developed on AWS.
  • Help the tribes enable observability features. Also document the process for future reference.
  • Assist on the creation and maintenance of pipelines to manage the observability components;
  • Monitor and reporting usage of our cloud solutions;
  • Help with developing the Observability pipelines.
  • Develop quality controls to complete assigned tasks, including; code driven deployment; infrastructure deployment; automated testing; and effective operational monitoring, alerting and incident responses.
  • Supply appropriate information and analysis to support resolution of issues and incidents with the tribes Observability.


As a consultant and as a binding part between developers and our clients you are expected to develop expertise both in technology and the means to communicate complex concepts and rationale to non-techies. We’ll encourage and support this with frequent opportunities to share ideas internally. We also have consultants who frequently deliver at regional, national and global conferences.

What do we offer in return?

Private Health Insurance

We hope you will never need it, but nevertheless, we offer private health insurance to all our employees.

Education Program

We never stop learning, that’s why we offer our employees an educational program with training and certification.

Wellbeing Program

We all deserve to live a healthy and well-balanced life. It's not an option, it's a necessity!

One simple rule

We really want you to enjoy your work, so we set up one simple rule, that none of our teammates will work on the same project for a very long time in the same industry, but will always be following the latest technology standards. You can never get bored working at Valtech!

Work from home

Our jobs wrap around our lives – not the other way around. Wherever you feel you can be most comfortable and productive as well, we make sure to respect your choice.

Social Events

We enjoy spending time together, not only at work. Ski trips, carting, laser-tag, wine tasting, picnics, cooking classes… you name it – we’ve done it! There are plenty of cool events to join and to get to know your colleagues.

Competitive Conditions

Besides a very competitive salary and additional vacation days, you will join annual company events with the whole team.

Challenging Projects

Ready for a challenge? We guarantee you’ll find challenging projects at Valtech!

Cool Colleagues

What’s the most important thing in a job? Cool colleagues with whom you spent most of the time during the week. We have a lot of them!

Honest Feedback

Honesty, openness and respect are among our core values. We encourage an open feedback culture in order to build trust and grow together.

Company values

We SHARE our knowledge with our clients and colleagues all over the world. We value different opinions and embrace open discussions. We DARE to go into unknown territories. We dare to speak up and be totally honest. We CARE about the end user experience, about our clients' businesses and about the quality of the things we make. We want to make the world a better place through the work we do.

Modern Office

Our office located in the inner Center of Skopje was fully renovated and furnished with the best and most comfy furniture out there

Free Beverages

Enjoy free coffee, drinks, and snacks at work, or join one of our famous company dinners.

24 vacation days

Bonuses…

Neem contact op

Let's reinvent the future