- Location
- Calgary, Alberta, Canada
- Bio
-
Technical Specialist, with growing competencies in Data Science, Machine Learning and Web Development. I've got years of experience in Oil & Gas field Ops, Health & Safety Management Systems, as well as Customer Service Management.
I'm currently transitioning into the Business/IT space with Business Intelligence, Data Analytics and Machine Learning Development capacity. My current career objective is to become an active part of the ongoing Digital Transformation taking place in today's industry, applying real-world solutions to business needs as they arise.
I am an Engineering Technologist by Training (T.T, ASET), with Petroleum Technology and Industrial Chemistry backgrounds. Held and performed Oil and Gas field Specialist roles in Western Canada.
I also have under my belt several years of Client Service, Transaction processing and Bank Branch management experience in a customer-focused and tech-driven retail banking environment.
- Resume
- OvoOkpubulukuDS.pdf
- Categories
- Data analysis Operations Project management Information technology
Skills
Socials
Achievements
Latest feedback
Team feedback
Team feedback
Team feedback
Recent projects
Immersive Visualization for AI Governance
Number of individual students required: 6 Be part of solving the critical problem of AI explainability. Explainability of AI and Machine Learning is a significant challenge, as is humanity's capacity to keep up with & govern ever-improving algorithms. Objective of this project is to apply our patented technology for visualizing & explaining an AI or machine learning model of your choice. For inspiration, we have to-date applied the technology to visualizing nodes of a Tensorflow model and for seeing dimensions used to train a random forest classifier in context of model-determined outcomes vs. actual outcome from the training dataset. Immersion Analytics' patented technology and products make it possible to see up to 18 dimensions of data as a single, intuitive visualization. This is accomplished using special effects as added axes, intensity of each effect is varied at each data point to convey additional dimensions. Our Stepwise Storyboarding capability permits adding dimensions one at a time as you tell the data story. Students will receive a personal-use license for Immersion Analytics Visualizer, which can be used in concert with Python, MATLAB, Qlik Sense, and even CSV data files. We have many samples for use with Python, several for MATLAB, and extensive documentation and install support. Students should be prepared to: Identify an AI or machine learning project where up to 18 dimensions, primarily numeric, are used to train the model. This can be a project from github or elsewhere online, or one of the student's design. Develop immersive visualization using our Python package or MATLAB toolbox alongside our Visualizer product Prepare a 2-page data storytelling walkthrough of your creation, with screenshots (greenshot is fine). While we are open to your inspirations, a few areas of specific interest for us include: Predicting customer lifetime value Recommendation engines Fraud detection
Data Analysis and Machine Learning for HR
Positions Available: 4 We’d like students to help analyze our datasets to identify and evaluate key performance indicators for a standard HR department. This project will create a ready to use HR analytical solution for medium size companies. The solution will use Power BI as a visualization tool and will have a machine learning module used to identify employees at risk of leaving the company. Here are some of the topics we’d like students to explore throughout this project: KPI identification, predictive and prescriptive analysis using Power BI. Involvement in the development of the ML module using Python and Tensorflow. Highlight the benefits of the solution to HR departments in different industries.
Data Scientists: for a Fintech Start-up
Number of positions available: up to 2 (working individually or in a team) Our team is seeking up to two Data Science/ML/AI students who are passionate and creative. We want this (these) students to build our unique streaming data pipelines. This is an incredible opportunity for any student(s) that are looking to build their resume and portfolio. The students will have the chance to lead this project as well as execute on the development. A bit more detail on the project: Background: This (These) student(s) will have the opportunity to develop our specific streaming pipelines and data workflow. Our business and dev team will work closely with the student(s) and help them execute on this project. Students will be using Python and Google Cloud Platform. If you are in your last year of your program and are looking for an incredible opportunity this is it. To accomplish this, we expect the student(s) will: Have experience coding with Python or Java . Be comfortable learning new tools and technologies. Ideally have some experience with GCP products including Dataflow, Apache Beam, BQ, Non-SQL databases and Pub/Sub. Knowledge of GIT is required. If you like the sound of our culture and are ready to tackle this incredible challenge with us, then we'd love to hear from you.
Newsletter Communications Analysis (Virtual)
Number of students: 1 Background: HPCO creates a bi-weekly newsletter for our members and stakeholder using Constant Contact. Project Objective: To evaluate the effectiveness of the newsletter content by analyzing data on "open" rates of each newsletter edition and the "click through" rates of each article in the newsletter. Project Deliverables: Download Constant Contact newsletter data Analyze the "click through" data for each article in the newsletters Summarize the trends (most popular topic, content, design etc) Create a written report of findings Create a PowerPoint presentation of findings
Work experience
Database and Analytics Upskill
Open to New Opportunities
Calgary, Alberta, Canada
February 2021 - Current
Upskilling in:
Database Design & Development with MS SQL Server;
Database Administration,
Predictive Modeling with IBM SPSS
Business Intelligence Analytics
Cloud computing concepts for Analytics and Machine Learning
Tools: Microsoft Excel, Microsoft SQL Server, PostgreSQL, IBM SPSS, Oracle SQL Developer, Microsoft Azure, Google Cloud Platform, AWS.
Technical Intern, Analytics and Data Center Mgt
A4 Systems Corp
Calgary, Alberta, Canada
January 2021 - March 2021
Technical support;
Technology and Industry research & Documentation;
Product analysis, planning & implementation;
Building out & testing of API frontend & User Experience with the Angular Framework
Business Process Modeling using BPMN;
Supporting Project Managers with inventory handling, coding, and organization;
Global Intern, Analytics and Data Center Management
Takenmind Technologies
January 2021 - January 2021
A virtual Collaborative learning effort, backed by United Nations Sustainable Development Goals(SDG);
Working to analyze Business Datasets to find growth and profitability curves;
Exploring Data for insights and Quality Assessments.
PROJECTS
1. Simplilearn Data Science Capstone Project (Retail Customer Segmentation): Exploratory Data Analysis, Cohort Analysis, RFM Predictive modelling using Python, and Dashboard Reporting using Tableau. Link: https://github.com/ovokpus/Customer-Segmentation
2. Global Internship SDG project: Exploratory Analytics and Visualization with Python 3. Delivered proof of concept for Analysis and predictive modeling aimed at reducing Employee Attrition in a Company.
Data Science & Web Development Student
Open to New Opportunities
Calgary, Alberta, Canada
January 2020 - December 2020
Actively involved in Various Data Science and Web Development Self-directed and Instructor-Guided Study Programs, Projects, and Virtual Internships with GE and KPMG.
PROJECTS
1. Car Manufacturing Test. Developed a Prediction model using Python, geared towards reducing time spent in testing various feature combinations for cars. Link: https://github.com/ovokpus/Car-Manufacturing-Test
2. General Electric Digital Technology Analytics Program: Performed Data Engineering on Dataiku DSS to create a single dataset from 8 datasets with the goal of developing a prediction model that will optimize the production process for aircraft parts. Also created a dashboard on Dataiku DSS to present insights on the data.
3. Portfolio Project (Web Development with ComIT): Created a Quiz Game App with Node.js/Express backend functionality. Link: https://github.com/ovokpus/Quiz-app-project
4. Portfolio Project(Data Science): Predicted Employee Salaries based on Job Description using Python 3. Link: https://github.com/ovokpus/Salary-Prediction
5. Consulting Virtual Internship with KPMG: Performed Exploratory Data Analysis and Preprocessing using Excel and Python on Client Dataset. Presented Data Quality report to client and wrote a proposal to Client on creating models that will target prospective buyers.
Key knowledge areas, skills and competencies acquired include:
Data Quality Assessment, Insights and Presentation;
Data Science with R & Python;
Object-Oriented Analysis, Design, & Programming;
Web Development with HTML, CSS, JavaScript, Bootstrap4, Node.js, MongoDB;
ML Regression and Classification Algorithms;
Natural Language Processing (NLP);
SQL; Visualizations with Tableau and Power BI
AWS Machine Learning Foundations; Microsoft Azure Machine Learning Studio; Dataiku DSS, Alteryx Designer..
High-level Overview & practical experience of Big Data & Hadoop Developer tools like MapReduce, Hive, Hue, Apache Sqoop, Apache Flume, Apache Kafka, and Apache Spark.
Education
Industry Certifications in Database Administration & BI Analytics, Techskills Initiative
Bow Valley College
February 2021 - Current
Bootcamp, Data Science Masters Program Bootcamp
Simpliearn
January 2020 - December 2020
Applied Bachelor's, Petroleum Engineering
Southern Alberta Institute of Technology
September 2013 - April 2015
Personal projects
Customer Segmentation
January 2021 - January 2021
In the retail Industry, it is a critical requirement for businesses to understand the value derived from their customers. RFM is a method used for analyzing customer value.
Customer segmentation is the practice of segregating the customer base into groups of individuals based on some common characteristics such as age, gender, interests, and spending habits.
Our objective here is to perform customer segmentation using RFM analysis. The resulting segments can be ordered from most valuable (highest recency, frequency, and value) to least valuable (lowest recency, frequency, and value).
Car Manufacturing Test
August 2020 - November 2020
Since the first automobile, the Benz Patent Motor Car in 1886, Mercedes-Benz has stood for important automotive innovations. These include the passenger safety cell with a crumple zone, the airbag, and intelligent assistance systems. Mercedes-Benz applies for nearly 2000 patents per year, making the brand the European leader among premium carmakers.
Mercedes-Benz is the leader in the premium car industry. With a huge selection of features and options, customers can choose the customized Mercedes-Benz of their dreams.
To ensure the safety and reliability of every unique car configuration before they hit the road, the company’s engineers have developed a robust testing system. As one of the world’s biggest manufacturers of premium cars, safety and efficiency are paramount on Mercedes-Benz’s production lines. However, optimizing the speed of their testing system for many possible feature combinations is complex and time-consuming without a powerful algorithmic approach.
Our objective in this project is to reduce the time that cars spend on the test bench using model prediction.
Salary Prediction Portfolio Project (Python)
May 2020 - August 2020
Our goal in this project is to examine this dataset of job postings, and predict salaries for a new set of postings. This will involve building a model to predict the salaries given in the test dataset.
A practical use of this is for a HR Department of a large company or a Consulting Outfit that needs real-time solutions in order to make effective employment offers to potential hires.
It also finds use in getting to understand current realities in the job market and how businesses can leverage this in order to secure high quality talent, whilst keeping hiring costs low.
The primary tool used for this project is Python 3, along with an extensive array of libraries and packages available for the manipulation of data,and development of predictive modeling algorithms.