Data Scientist
Job Details
About the Company
With operational hubs scattered across Europe, Asia, and LATAM, and its headquarters situated in San Francisco, US, the company boasts a workforce of over 1,000 adept professionals. Spanning across more than 20 countries, ALLSTARSIT offers a diverse range of skilled employees across various verticals, including AI, cybersecurity, healthcare, fintech, telecom, media, and so on.
About the Project
We’re looking for a talented Senior Data Scientist to join our Research Team. The Team discovers, develops, and implements our core technology using both established and
novel approaches, including NLP and tabular data processing.
In this role, you will drive the research and development of novel AI and algorithmic approaches, advancing ideas from hypothesis to proof of concept. You will conduct anin-depth exploration of complex datasets, investigate underlying patterns, and rigorously evaluate alternative methods to identify the most effective solutions. Your work will leverage techniques such as NLP, GenAI, and other ML-based techniques, as well as tabular data analysis and other advanced computational methods, to address challenging research problems.
Specialization
Headquarters
Years on the market
Team size and structure
Current technology stack
Required skills:
- M.S. or Ph.D. in Computer Science or a related quantitative field
- 3+ years of industry experience in NLP and/or tabular data processing
- 2+ years of hands-on experience with deep learning methods
- Experience delivering AI-driven projects to production
- Expertise in both classical and generative NLP (including text-to-SQL or generative
tabular data) - Strong Python skills and experience with deep learning frameworks
- Familiarity with cloud-based NLP platforms is a plus
- Excellent English communication skills (written and verbal)
- Strong collaboration, communication, and interpersonal skills
- Highly organized, self-motivated, and able to work independently
Scope of work:
- Help to lead the technical AI roadmap development and assist team members with the practical research and its implementation
- Design, develop, and implement NLP and tabular data algorithms using diverse methods,
e.g., GenAI, RAG, deep learning, classical NLP and ML, NER, and rule-based regex - Work with large datasets, clean and preprocess data, and develop pipelines for data extraction and transformation
- Stay updated on the latest developments in NLP and tabular data analysis, and apply
state-of-the-art techniques to solve specific business problems - Collaborate with cross-functional teams, including software developers, DevOps
engineers, and domain experts, to integrate the research solutions into production