Senior Data Scientist
Cyber Cloud Technologies
- Washington DC
- $75,000-190,000 per year
- Permanent
- Full-time
- Primary responsibilities will focus on data science support
- Review model output results according to technical requirements and verify that the algorithm design and implementation meet expectations, provide feedback and recommendations to address identified issues
- Improve efficiency of data verification through process optimization and process automation
- Convert existing modules and queries from legacy platforms such as SAS/STATA to PySpark for cluster-based processing
- Collaborate with other data scientists on data analysis, implementation review, model verification, and performance improvement
- Brainstorm, design, and develop data processing approaches, modules, and functions; prepare test data and configurations
- Identify and evaluate open-source software packages to support modelling and analysis to improve cost-effectiveness
- Work with application engineering and DevOps teams on data science package deployment; assist the teams on issues related to data science
- Setup Spark and customize unit development environments according to technical needs
- Maintain code base, data sets, and the related project documents associated with the data analysis
- Work with shareholders on technical directions, project requirements, and technical updates
- Provide SME-level support on cutting-edge methodologies such as those related to Artificial Intelligence and Machine Learning
- Bachelor's Degree with 10+ years data science; 5 years Python/Pyspark/SAS/STATA
- US Citizenship required. Candidates must also be able to pass a Department of Commerce Public Trust investigation
- Extensive data science knowledge with strong modeling, analysis, and hands-on data background
- Expertise in large-scale data processing, data research, data mining, and verification
- Experience in Agile project methodology in a fast-pace development project environment
- Familiar with information security principles and policies to protect sensitive information
- Knowledge in applying AI/ML to data science projects and produce business results
- Data Engineering experience desired
- AWS database skills, vector DBs and graph DBs like Neptune
- AWS SageMaker