Capturing Training Data using Natural Language Processing
Project scope
Categories
Data analysis Data modelling Machine learning Artificial intelligence Data scienceSkills
data preprocessing chunking development environment natural language processing (nlp) data analysisUbineer is looking to add to our data set by integrating advanced Natural Language Processing (NLP) techniques. Currently we are seeking to build a massive data set that understands complex queries to capture data. The goal of this project is to expand/improve our data sets so that we can train a Large LLM. We are seeking student who want to understand how NLP works and have a passion in data analysis. The project will involve tasks such as data preprocessing, capturing, and storing. By the end of the project, the students would understand key NLP techniques such as chunking.
The deliverables for this project include a 1-2 hour tutorial, weekly stand up meetings and if all goes well, some of the code you generate will land on our production environment.
At the end of the project students will be responsible for creating a 2-3 page document (report) describing what they learned and completed.
The report should have:
- Basic Description of project
- What was completed
- Number of Companies,
- Number of of files parsed
- Number of data points captured
- Number of text segement captured.
- Speed per capture.
Providing specialized, in-depth knowledge and general industry insights for a comprehensive understanding.
Sharing knowledge in specific technical skills, techniques, methodologies required for the project.
Direct involvement in project tasks, offering guidance, and demonstrating techniques.
Providing access to necessary tools, software, and resources required for project completion.
Scheduled check-ins to discuss progress, address challenges, and provide feedback.
Supported causes
Industry, innovation and infrastructureAbout the company
Ubineer is a leading AI financial technology company focused on delivering productivity to financial decision makers. Our goal is to be the primary source for knowledge management, collaboration and insights in the investment industry. Our mission is to help financial decision makers optimize the quality and speed of their investment lifecycle.