- Assist in implementing a robust, scalable, and high-performance data storage and management system for complex genomic datasets.
- Assist in developing and documenting system architecture, data models, data flow diagrams, and system interfaces.
- Contribute to identifying and addressing performance bottlenecks within the data storage and retrieval pipeline.
- Assist in developing and implementing a scalability plan to accommodate the rapid growth of clinical data.
- Prepare documentation and conduct comprehensive testing activities across the full SDLC, including unit tests, SIT, UAT, load tests, and regression tests.
- Support routine operational activities to maintain overall system efficiency, security, and reliability.
- Perform any other duties assigned by senior officers to ensure successful project delivery.
skills & experiences required.
- Bachelor’s degree in Computer Science, Information Technology, or a related technical discipline.
- Minimum 4 years of relevant working experience in data management or software engineering.
- Solid experience in data warehousing, data modelling, and ETL pipeline development.
- Hands-on with AWS cloud technologies, Linux sysadmin, and modern databases (S3, SQL, Clickhouse).
- Familiarity with DevOps tools, specifically containerisation via Docker and Kubernetes.
- Strong coding proficiency in either Java or Python.
- Excellent command of English alongside fluent Cantonese (Mandarin is an advantage).
If you are interested in this role, please click 'Apply Now' or send your CV directly to russell.regalado@randstad.com.hk.