|
岗位职责
1. 云平台管理与运维:
- 负责公司 AWS 云平台(VPC, EC2, ECS CLUSTER, S3, IAM, RDS 等)的日常管理、监控、维护、故障排查和性能优化;
- 保障公司线上系统(7x24小时)的稳定、安全与高效运行;
2. 自动化与CI/CD:
- 设计和维护基于 Jenkins/GitLab CI 的持续集成和持续部署 (CI/CD) 流水线,实现应用的自动化部署和发布;
3. 监控与告警:
- 搭建和维护基于 Prometheus/Grafana 或 AWS CloudWatch 的监控告警体系,确保能快速发现并响应问题;
4. 安全与合规:
- 实施云安全最佳实践,管理网络访问控制(Security Group/NACL),定期进行安全审计和漏洞扫描;
- 管理 IAM 权限,遵循最小权限原则;
5. 成本优化:
- 监控和分析 AWS 资源使用情况,提出并实施成本优化方案(如使用 Reserved Instances, Spot Instances,优化存储等);
6. 技术支持:
- 为开发团队提供云资源和技术支持,协助解决应用程序部署和运行中的问题;
7. 其他:
- 收集各部门对业务系统改进的需求并分析,协调开发人员进行系统改进,协助测试及实施;
- 撰写业务系统相关技术文档和流程手册,主持业务系统使用相关培训,及时解答各部门在系统使用过程中的疑问和咨询;
- 管理系统账号及权限;
- 协调及配合供应商解决ERP运行过程中的突发状况和问题,保证系统的正常使用;
- 负责IT运维相关流程的规划、设计、推行、实施和持续改进;
1. Cloud Platform Management and Operations:
· Manage, monitor, maintain, troubleshoot, and optimize the company’s AWS cloud platform (VPC, EC2, ECS Cluster, S3, IAM, RDS, etc.);
· Ensure stable, secure, and efficient operation of the company’s online systems (24/7);
2. Automation and CI/CD:
· Design and maintain Jenkins/GitLab CI-based continuous integration and continuous deployment (CI/CD) pipelines to achieve automated application deployment and release;
3. Monitoring and Alerting:
· Establish and maintain a monitoring and alerting system based on Prometheus/Grafana or AWS CloudWatch to ensure rapid issue detection and response;
4. Security and Compliance:
· Implement cloud security best practices, manage network access controls (Security Groups/NACLs), and conduct regular security audits and vulnerability scans;
· Manage IAM permissions following the principle of least privilege;
5. Cost Optimization:
· Monitor and analyze AWS resource usage, propose and implement cost optimization strategies (e.g., Reserved Instances, Spot Instances, storage optimization, etc.);
6. Technical Support:
· Provide cloud resource and technical support to development teams, assisting in resolving application deployment and operational issues;
7. Other Duties:
· Gather and analyze business system improvement requirements from various departments, coordinate with developers for system enhancements, and assist in testing and implementation;
· Write technical documentation and process manuals related to business systems, conduct training sessions on system usage, and promptly address inquiries from departments;
· Manage system accounts and permissions;
· Coordinate with vendors to resolve emergencies and issues during ERP operation to ensure normal system functionality;
· Plan, design, implement, and continuously improve IT operations-related processes;
任职资格
1. 必备条件:
- 3年以上 AWS 云平台管理和运维经验,持有 AWS Solutions Architect Associate 或 SysOps Administrator Associate 等相关认证者优先;
- 精通 Linux/Ubuntu 操作系统,具备扎实的 Shell 脚本编写能力;
- 熟悉至少一种配置管理工具,如 Ansible;
- 熟练使用 Docker 进行应用容器化,有 Kubernetes (EKS) 经验者优先;
- 具备良好的故障排查能力,能快速定位并解决网络、系统及应用层面的问题;
- 具备2年以上的ERP、OA等系统实施、维护经验者优先;
- 强烈的责任心和团队协作精神,具备优秀的问题分析和解决能力;
2. 技术栈匹配(有相关经验者优先):
- 网络与中间件:熟悉 APISIX, Nacos, Kafka, Redis 等中间件的部署、配置和调优;
- 数据库:对 MySQL 和 ClickHouse 有基本的运维和排障能力(备份、恢复、性能查看);
- 存储:具有 AWS S3 的实战管理经验,包括生命周期策略、权限管理等;
- 语言:了解 Java/Go 应用的基本部署模式和特点,能与开发团队顺畅沟通;
1. Essential Requirements:
· 3+ years of experience in AWS cloud platform management and operations; holders of AWS Solutions Architect Associate or SysOps Administrator Associate certifications are preferred;
· Proficiency in Linux/Ubuntu operating systems with solid Shell scripting skills;
· Familiarity with at least one configuration management tool, such as Ansible;
· Experience in application containerization using Docker; knowledge of Kubernetes (EKS) is a plus;
· Strong troubleshooting skills with the ability to quickly identify and resolve network, system, and application-level issues;
· 2+ years of experience in implementing and maintaining ERP, OA, or similar systems is preferred;
· Strong sense of responsibility, teamwork spirit, and excellent problem-analysis and resolution skills;
2. Technical Stack (Preferred Experience):
· Networking and Middleware: Familiarity with deployment, configuration, and tuning of middleware such as APISIX, Nacos, Kafka, Redis, etc.;
· Databases: Basic operational and troubleshooting skills for MySQL and ClickHouse (backup, recovery, performance monitoring);
· Storage: Hands-on experience managing AWS S3, including lifecycle policies and permission management;
· Programming Languages: Understanding of basic deployment patterns and characteristics of Java/Go applications, with the ability to communicate effectively with development teams.
薪资面议。
工作时间安排:标准工时: 每周一至周四,10:00 至 18:30; 工作地点: 马德里-Villaverde 周五提前下班: 工作时间为 10:00 至 17:30; 休息安排: 实行双休制(周末休息),并遵循国家法定节假日放假安排; 额外带薪假期: 每年年底享有 12月24日、26日及31日 三天额外带薪假期。
应聘方式:
蒋小姐: 624129781
我们将尽快与符合条件的候选人联系并安排面试。
|
|