Data Scientist I/ II/ Sr

Houston, TX
Permanent
Full-time

13 days ago

*To access Dayforce at its best, log in from the latest version of Chrome, Safari, Firefox, or Edge. *Boardwalk is a limited partnership operating in the midstream portion of the natural gas and natural gas liquids industry, providing transportation and storage services for our customers. Our 14,000 miles of pipeline and storage assets provide diverse market connectivity to producers and end-users who need reliable sources of natural gas for power generation, home heating or petrochemical feedstocks. We have the experience, knowledge, and flexibility to design service offerings and create system enhancements tailored to our customers’ needs throughout the 13 states in which we operate. As an organization focused on sustainability, we are committed to protecting the environment while delivering this energy source. This commitment is made to our customers, employees, and the communities in which we operate. We incorporate environmental stewardship, safety, and compliance into our day-to-day operations and seek to strengthen and support the communities we serve. Additional information about the company can be found online at .We are currently looking for a Data Scientist I/ II/ Sr for our Houston, TX office.POSITION DESCRIPTION:The Data Scientist plays a key role in advancing digital innovation by developing data-driven models and insights that support operational efficiency, asset optimization, and strategic decision-making. This role will work closely with data engineers, business stakeholders, and digital leadership to design and deploy machine learning (ML) models, predictive analytics, and advanced visualizations that drive measurable business outcomes.The ideal candidate combines strong analytical skills with deep technical expertise in cloud-based data science tools, particularly within the Amazon Web Services (AWS) ecosystem. This role requires a passion for solving complex problems, a collaborative mindset, and the ability to translate data into actionable insights for a midstream energy environment.KEY RESPONSIBILITIESModel Development & Advanced Analytics

Design, build, and deploy predictive models and machine learning (ML) algorithms to support asset performance, reliability, and commercial optimization.
Conduct exploratory data analysis (EDA), feature engineering, and statistical modeling using Python, R, or similar tools.
Develop time series forecasting, anomaly detection, and classification models for operational and business use cases.
Apply geospatial and sensor data analytics to support pipeline monitoring, flow optimization, and risk assessment.

Cloud & Platform Integration

Leverage Amazon Web Services (AWS) tools such as SageMaker, Redshift, Simple Storage Service (S3), Lambda, and Glue for model training, deployment, and data access.
Collaborate with data engineers to ensure models are integrated into production pipelines and dashboards.
Use version control (e.g., Git) and continuous integration/continuous deployment (CI/CD) practices to manage model lifecycle and reproducibility.

Business Collaboration & Impact

Partner with operations, engineering, and commercial teams to identify high-impact use cases and translate business needs into analytical solutions.
Present findings and recommendations through compelling data visualizations and storytelling using tools like Power BI or Tableau.
Support the development of self-service analytics and promote data literacy across the organization.
Document model assumptions, limitations, and performance metrics to ensure transparency and trust.

Model Governance & Continuous Improvement

Monitor model performance and retrain as needed to maintain accuracy and relevance.
Implement MLOps practices to support scalable, automated model deployment and monitoring.
Ensure compliance with data governance, privacy, and ethical AI standards.
Stay current with industry trends and emerging technologies to continuously improve analytical capabilities.

REQUIRED SKILLS, KNOWLEDGE, AND EXPERIENCE:

Bachelor’s or Master’s degree in Data Science, Computer Science, Engineering, Statistics, or a related field.
3–5+ years of experience in applied data science, preferably in the energy, utilities, or industrial sectors.
Proficiency in Python, SQL, and data science libraries such as pandas, scikit-learn, TensorFlow, or PyTorch.
Experience working with AWS services including SageMaker, Redshift, S3, and Lambda.
Strong understanding of statistical modeling, machine learning, and data visualization techniques.
Ability to communicate complex technical concepts to non-technical stakeholders.
Experience working with large datasets and real-time or near-real-time data environments.

PREFERRED SKILLS, KNOWLEDGE, AND EXPERIENCE:

Experience in the natural gas midstream or broader oil & gas industry.
Familiarity with MLOps practices and tools for model monitoring and retraining.
Exposure to geospatial data, sensor data, or SCADA systems.
Experience with Power BI, Tableau, or similar BI tools.
Knowledge of data governance, security, and compliance in cloud environments.

REQUIRED EDUCATION:

Bachelor’s degree in Computer Science, Engineering, Statistics, or related field

PREFERRED EDUCATION:

Master’s degree

ADDITIONAL INFORMATION:Boardwalk Pipelines, LP, maintains a drug-free workplace and will require pre-employment drug & substance abuse testing before hire.Boardwalk Pipelines, LP is an equal opportunity employer. All applicants will be considered for employment regardless of race, color, religion, age, sex, gender identity, national origin, veteran, or disability status.

Boardwalk Pipelines

Apply Now