
Principal, Data Scientist
- Dallas, TX
- $112,700-155,000 per year
- Permanent
- Full-time
- Create data derivation and linkage through algorithm and/or data rules.
- Create and select predictive features from raw data.
- Create model reports for stakeholder review and model documentation.
- Perform pattern recognition model creation and training using various types of algorithms and machine learning modeling techniques for unknown or less well-defined – an unknown problem, solve new problems (e.g., clustering, logistic regression, deep learning, support vector machines, boosting trees, supervised and unsupervised models).
- Develop analytics using Python, PySpark and other big data tools such as AutoML, BigQuery(BQ), BQML, DataPrep and Jupyter Notebook.
- Develop analytics solutions using traditional database technology such as SQL and big data technology such as Hadoop, MongoDB, Cassandra, Redis, PostgreSQL and Neo4j, NoSQL databases.
- Perform complex data analysis on large volumes of property and consumer data and present model performance to stakeholders.
- Provide data science and analytic support for the Enterprise Data Solutions Group (EDSG) and interact with other EDSG teams as well as other business units.
- QA and analyze large amounts of data to create appropriate data sets for model building.
- Prepare and maintain programs and documentation for analytic models.
- Conduct defined quantitative and qualitative research projects independently and communicates research results to internal and external stakeholders.
- Act as a consultant to EDSG’s data management and data technology teams.
- Lead and provide technical leadership on data science project with other data scientists, data engineers, and data analysts.
- Master’s or higher in artificial intelligence, data science, computer science, math, statistics or engineering field, or equivalent work experience.
- Ability to thrive in a team environment and adapt to quickly changing priorities.
- 10+ years of directly related experience.
- Strong problem solving and analytical ability.
- Strong communication skills.
- Ability to quickly and efficiently adapt to new concepts.
- Demonstrated knowledge of statistical techniques.
- Data mining/data analysis experience.
- Strong working knowledge in multiple analytic development tools, statistical tools, programming languages, and big data tools.
- Working knowledge in cloud platform (e.g. AWS, GCP), cloud-based sandbox environment and analytics sandbox and libraries.
- Ability to work collaborative with cross-function teams and business units.
- Demonstrated general business acumen; experience working in a real estate and mortgage related data industry a plus.
- Ability to lead small team in technical projects.
- Patents and publications in data science field a plus