Data Science & Engineering Lead
Cape Town, Western Cape
Posted 19 May 2020

Job Details

Job Description Digital is hiring!

We have a vacancy for a Software Engineering Lead situated at our offices in Cape Town. is South Africa's largest digital publishing house comprising of a network of popular digital publishing brands and online services.

Role Title: Data Science & Engineering Lead

Job Family: Developer Manager/Software Engineering Lead

Reports to: Head of Core Platforms and Architecture

Direct Reports: Data Engineer, Data scientists, Implementation architect

Overview of the role

You will be leading and working on developing our major data infrastructure components and data products. You will be managing the team that owns the Datawarehouse and analytics products for This includes predictive models for various aspects of the business.
You will be working primarily within the Google Cloud Environment using a variety of the tools that Google offers from Biquery and Dataproc to Kubernettes and AI Hub. You should not be afraid to dive into dirty data and help the team make sense of it. We are in the game of taking data and turning it into amazing stories and pretty pictures that helps the decision makers drive the business forward.
We work in an agile environment, alongside a young, dynamic, and multi-skilled team of developers, data engineers, data scientists, designers, as well as working closely with product owners. The aim is to deliver sound, technical solutions based on the needs of the business and users.
Work life balance is also incredibly important to us, so our fast-paced working environment is engineer-led and project dynamic. The environment, task , resources change according to critical projects.
Main purpose of the role:
Effectively lead Data team to utilize their analytical, statistical, and programming skills to collect, analyze, and interpret large data sets. Derive information to develop data-driven solutions to difficult business challenges within the business.
Key responsibilities:
• Using the agile methodology to keep track of team and individual progress.
• Understand the business context and objectives for each data science project by working closely with key stakeholder groups
• Identify relevant data sources and sets to mine for client business needs; and collect large structured and unstructured datasets and variables.
• Work as a data strategist, identifying and integrating new datasets that can be leveraged through our product capabilities and work closely with the engineering team to strategize and execute the development of data products.
• Execute experiments methodically to help solve various problems and make a true impact across various domains and industries.
• Devise and utilize algorithms and models to mine big data stores, perform data and error analysis to improve models, and clean and validate data for uniformity and accuracy.
• Analyze data for trends and patterns; and Interpret data with a clear objective in mind.
• Implement models into production by collaborating with software developers and machine learning engineers.
• Communicate solutions to stakeholders and implement improvements as needed to operational systems
• People management and driving project led work within tight deadlines.

Skills and competencies:
• Excellent understanding of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, Neural Networks etc.
• Experience with common data science toolkits, such as PyTorch, TensorFLow, SciKitLearn, Spacy, R, Weka, NumPy, MatLab, etc . Excellence in at least one of these is highly desirable
• Experience with data visualization tools, such as D3.js, GGplot, DataStudio etc.
• Proficiency in using query languages such as SQL, Hive, Pig
• Experience with NoSQL databases, such as MongoDB, Cassandra, BigTable
• Experience with Cloud Technology platforms
• Good applied statistics skills, such as distributions, statistical testing, regression, etc.
• Good scripting and programming skills
• Experience with open source data science platforms e.g. Kubeflow, ML Flow, Flyte
• Experience working with and creating data architectures.
• Excellent written and verbal communication skills for coordinating across teams.
• Experience analyzing data from 3rd party providers such as Google Analytics, Site Catalyst, Coremetrics, Adwords, Crimson Hexagon, Facebook Insights, etc.
• An understanding of ethics in data science, data privacy and biases in modeling data.
• A drive to learn and master new technologies and techniques.
• Experience with Natural language programming and understanding is a plus.
• Any exposure to Digital Marketing is a plus.

Required Preferred
Qualification Any relevant online certificates and certifications or recognition of related work experience Bachelor's degree in stats, applied math, or related discipline. (Any quantitative background)
Experience 3 year+
Proven ability to lead a project or team 5+