Deskripsi Pekerjaan
Informasi lengkap tentang posisi dan persyaratan
Ringkasan Yukerja
Lowongan SRE Engineer di PT. Dollar Information Consultan Indonesia kami kurasi dari JobStreet (kategori Teknologi & IT). Perhatikan lokasi kerja (Jakarta) sebelum melamar. Yukerja.com bukan pemberi kerja — lamaran diproses di situs sumber resmi.
Job Description
1. Experience operating large-scale GPU cluster data centers, such as those with thousands or tens of thousands of NVIDIA GPUs.
2. Ability to identify and resolve common issues in AI infrastructure computing centers—including GPU, storage, and network-related problems—with sound technical judgment.
3. Demonstrated ability to track known issues end-to-end, improve problem-resolution efficiency, and standardize incident-handling processes.
4. Experience ensuring data center stability through proactive measures, such as configuring monitoring dashboards and performing daily inspections.
5. Experience deploying and implementing common monitoring tools, including Prometheus and Grafana.
Job Requirements
1. Bachelor’s degree or higher in computer science or a related field, with at least three years of experience in data center operations and maintenance.
2. Proficiency in core network protocols, including BGP, OSPF, IS-IS, VXLAN, and EVPN.
3. In-depth understanding of and hands-on experience with high-performance intelligent computing networks—such as InfiniBand, RoCEv2, and lossless Ethernet.
4. Preferred: Experience with SDN controllers (e.g., ONOS, OpenDaylight, or P4).
5. Fluent in English (CET-6 level or equivalent); proficiency sufficient to serve as the working language is strongly preferred.