Job description
1. Network Architecture and Security Construction
- Lead the network architecture design and optimization for platforms such as Alibaba Cloud International, Akamai, and CloudFlare.
- Responsible for the construction of production network security boundaries, zero-trust production environment, configuration and maintenance of WAF and DDoS protection strategies to ensure business network security.
- Handle complex network failures, perform packet capture analysis and link optimization.
2. System Stability and Deep Operations
- Have a deep understanding of Linux/Unix-like systems, capable of deep tuning at the operating system level, kernel parameter optimization, and troubleshooting.
- Ensure high availability of core business systems, establish a complete monitoring alert and fault emergency response mechanism (SLA assurance).
- Responsible for performance tuning and stability maintenance of middleware such as Nginx, MQ, Redis, and MySQL.
3. DevOps and Tool Platform Development
- Design and implement efficient CI/CD pipelines to improve release efficiency and quality.
- Use Python/Go/Shell to independently develop operation and maintenance automation tools, monitoring system plugins, and operation management platforms.
- Manage cloud resources through Infrastructure as Code (IaC) to enhance resource delivery efficiency and standardization.
Technical Requirements
1. Network and Cloud Platform
- Proficient in TCP/IP protocol stack, HTTP/HTTPS, DNS, BGP, and other underlying network principles.
- In-depth understanding of the architecture and operation of core components of Alibaba Cloud International (ECS, SLB, VPC, CEN, NAT, etc.).
- Proficient in configuring CDN and security products such as Akamai and CloudFlare, with rich experience in anti-attack (CC/DDoS) and WAF rule tuning.
2. System Operations
- Proficient in Linux system management, with strong problem identification and root cause analysis skills (proficient in using tools like strace, tcpdump, perf, etc.).
- Familiar with source-level configuration and tuning of commonly used open-source middleware (Nginx, Redis, MQ, etc.).
3. Development Capability (Architecture and Implementation)
- Proficient in Python or Go (one of the two), with good coding standards and backend architecture design capabilities.
- Experience in actual operation platforms, CMDB, automation frameworks, or secondary development of middleware.
- Familiar with commonly used DevOps toolchains (Jenkins, GitLab CI, Ansible, Terraform, etc.).
Bonus Points
- Experience in complex network operations for internet overseas business (Global Business).
- Experience in financial/Web3 operations.
- Basic maintenance capability of K8s (although current K8s construction is on hold, basic knowledge reserve is required).
- Hold Alibaba Cloud ACP/ACE certification or senior network engineer certification (CCIE/HCIE, etc.).
- Qualifications
- Over 5 years of frontline operation and maintenance, DevOps work experience.
- Comprehensive architecture design and implementation capability in network, system, and development.
- Strong sense of responsibility and ability to work under pressure, capable of independently handling technical challenges and implementing solution designs.