Senior Data Engineer
Company: Castlight Health
Location: San Francisco
Posted on: June 26, 2022
Engineering San Francisco, California Salt Lake City, Utah
At Castlight, our mission is to empower people to make the best
choices for their health and to help companies make the most of
their health benefits. We offer a health benefits platform that
engages employees to make better healthcare decisions and can guide
them to the right program, care, and provider.
The platform also enables benefit leaders to communicate and
measure their programs while driving employee engagement with
targeted, relevant communications.
To date, Castlight has partnered with more than 240 large
enterprise customers, spanning millions of lives, to improve
healthcare outcomes, lower costs, and increase benefits
WHAT YOU'LL BE DOING:
Castlight's Data Engineering Team is responsible for the end-to-end
management of data lifecycle for both batch & streaming data - from
data acquisition (inbound), ingestion & validation, warehousing
(reporting and analytics) and data distribution (outbound).
We are looking for seasoned data engineers who have a penchant for
problem-solving with a passion for data processing innovations,
supportability & data quality to join our Data Engineering Team.
The candidate must be comfortable dealing with large sets of data,
disparate data sources, imperfect data sets and can drive data
automation that will remove frictions in data processing.
As data volume increases, the candidate must apply deep knowledge
in data modeling, query performance tuning, scalability and
optimization to implement best practices and create solutions.
To succeed in this role, you'll need to be a versatile engineer.
You must have an innate desire to build things the right way. You
should be pain-averse -- if some process or system isn't as
streamlined as it could be, you want to fix it! You will have a lot
of interaction with the Customer Success team and other externally
facing groups hence verbal and written communications skills are
required. You must be a champion of standardizing and maturing the
business process to support scalability.
You will be working with cross-functional teams within and outside
of the engineering department and with team members in SF and
India. This is a full-time position at our San Francisco
Data Discovery - analyzing data values and data patterns to
identify the relationships that link disparate data elements into
logical units of information, or "business objects" (such as
customer, patient or claim). Identify the transformation rules that
have been applied to a source system to populate a target such as
transactional entities, operational data store or data
Data Architecture and Modeling - Create logical and physical data
models, including conceptual models. Define data attributes,
including domain constraints and privacy attributes. Discover,
explore, and visualize the structure of data sources. Discover or
identify relationships between disparate data sources. Compare and
synchronize the structure of two data sources
Data Governance/Stewardship - Record the business use for defined
data. Identify opportunities to share and re-use data. Monitors the
progress towards, and tuning of, data quality and data security
target metrics. Ensures the quality, completeness, and accuracy of
data definitions. Identifies and manages the resolution of data
quality and data security issues, such as uniqueness, integrity,
accuracy, consistency, privacy and completeness in a cost-effective
and timely fashion. Identify procedures for disaster recovery and
data archiving to ensure effective protection and integrity of data
Data quality - Ensure the stability, integrity and efficiency of
data access and data quality across the organization via ongoing
database support and maintenance.
Database architecture, administration and development - Work with
application development staff to develop database architectures,
coding standards, and quality assurance policies and procedures.
Participate in testing and implementing database design and
functionality and tuning for performance.
Metadata - support information governance by providing reporting
and traceability on data movement, modeling and business
intelligence applications, as required by regulatory requirements.
Analyze and view the impact of changes to the current information
model, avoiding potentially disruptive modifications to existing
Customer Implementation and Production Support - Assist Customer
Implementation and Production support teams as DM SME during
Data warehouse - Knowledge in developing queries against Enterprise
Data Warehouse star schema is a nice to have. Participate in
research and development of recommendations regarding database
components, including hardware, database systems, ETL software,
metadata management tools and database design solutions.
Minimum 2 years experience building data pipelines with Google
Cloud Platform. Working knowledge of Cloud Data Flow, Cloud Data
Fusion, Cloud Dataproc, Pubsub.
Minimum of 5 years of experience with Python/Java.
Minimum of 5 years of experience writing SQL for querying
databases. Ability to extract information from databases using
complex query statements and advanced database tools.
Minimum of 5 years of experience analyzing and developing data
requirements and data specifications; hands-on experience
documenting understanding and analysis of databases and data
Minimum of 5 years of experience developing backend data sources
for the reporting and analytics platform.
Minimum of 5 years of technical experience with designing,
building, installing, configuring and supporting database servers
including database tuning and troubleshooting experience.
Experience with Informatica.
Experience with version control tools & release management.
BS in Computer Science or related Degree, or equivalent work
You have been redirected to a Castlight Health job page
Keywords: Castlight Health, San Francisco , Senior Data Engineer, Engineering , San Francisco, California
Didn't find what you're looking for? Search again!