Site Name: USA - California - San Francisco, Cambridge 300 Technology Square, London The Stanley Building, Seattle Sixth Ave, Upper ProvidencePosted Date: Nov 20 2023At GSK, we want to supercharge our data capability to better understand our patients and accelerate our ability to discover vaccines and medicines. The Onyx Research Data Platform organization represents a major investment by GSK R&D and Digital & Tech, designed to deliver a step-change in our ability to leverage data, knowledge, and prediction to find new medicines.We are a full-stack shop consisting of product and portfolio leadership, data engineering, infrastructure and DevOps, data / metadata / knowledge platforms, and AI/ML and analysis platforms, all geared toward:Building a next-generation, metadata- and automation-driven data experience for GSK's scientists, engineers, and decision-makers, increasing productivity and reducing time spent on "data mechanics."Providing best-in-class AI/ML and data analysis environments to accelerate our predictive capabilities and attract top-tier talent.Aggressively engineering our data at scale, as one unified asset, to unlock the value of our unique collection of data and predictions in real-time.Automation of end-to-end data flows:Faster and reliable ingestion of high throughput data in genetics, genomics and multi-omics, to extract value of investments in new technology (instrument to analysis-ready data inEnabling governance by design of external and internal data:with engineered practical solutions for controlled use and monitoringInnovative disease-specific and domain-expert specific data products: to enable computational scientists and their research unit collaborators to get faster to key insights leading to faster biopharmaceutical development cycles.Supporting e2e code traceability and data provenance:Increasing assurance of data integrity through automation, integrationImproving engineering efficiency:Extensible, reusable, scalable,updateable,maintainable, virtualized traceable data and code would be driven by data engineering innovation and better resource utilization.We are looking for a skilled and experienced Data Platform Engineer II to join our growing team. Data Platform Engineers take full ownership of delivering high-performing, high-impact data platform as products, and services, from a description of a problem customer Data Engineers are trying to solve all the way through to final delivery (and ongoing monitoring and operations). They are standard bearers for software engineering and quality coding practices within the team and are expected to mentor more junior engineers; they may even coordinate the work of more junior engineers on a large project. They devise useful metrics ensuring their services are meeting customer demand, having an impact, and iterate to deliver and improve on those metrics in an agile fashion.The Data Platform team builds and manages reusable components and architectures designed to make it both fast and easy to build robust, scalable, production-grade data products and services in the challenging biomedical data space.A Data Platform Engineer II is a technical individual contributor, buildingmodern, cloud-native systemsfor standardizing and templatizing data engineering, such as:- Standardized physical storage and search / indexing systems- Schema management (data + metadata + versioning + provenance + governance)- API semantics andontology management- Standard API architectures- Kafka + standard streaming semantics- Standard components for publishing data tofile-based, relational, and other sorts of data stores- Metadata systems- Tooling for QA / evaluationetc.A Data Platform Engineer II knows the metrics desired for their tools andservices anditerates to deliver and improv