Software Developer -Human Language Technology Data
Company: Softworld Inc
Location: Lexington
Posted on: March 20, 2025
|
|
Job Description:
Job Title: Software Developer - Human Language Technology
Data
While professional experience and qualifications are key for this
role, make sure to check you have the preferable soft skills before
applying if required.
Job Location: Lexington, MA 02420
Onsite Requirements:
Speech Research & Design
HLT/ML Learning
Software DevOps
Job Description:
Position Scope and Job Functions:
HLT Data Collection:
Implement Natural Language Processing (NLP) and Machine Learning
(ML) tools and techniques to create and enhance data for Human
Language Technology (HLT) evaluation and applications.
Use DOMINO Lincoln workflow system for creating interactive foreign
language training and testing materials.
Human Language Technology Evaluation:
Ability to identify and apply benchmarks to evaluate AI model
performance.
Ability to create custom multi-lingual datasets for the evaluation
of machine translation (MT) and automatic speech recognition (ASR)
systems.
Generative AI:
Expertise in Large Language Models (LLMs), generative AI to
operational systems.
Skills include programming abilities: llama-cpp, mistral, GPT4All,
Chat-GPT, Orca, and transformer-based capabilities generally.
For Audio/Speech QA/QC:
Define, create, and implement audio applications for measurement
and enhancement of audio and speech recording quality.
Assessing speech corpora integrity.
Coordinating with and providing guidance to subcontractors
providing speech corpora.
Advanced Audio Data Analysis:
Ability to design, implement, and confirm the performance of an
audio data collection method for the speech intelligibility
evaluation of wearable acoustic sensors.
Human Subjects Protocols:
Design, author, and implement study protocols for the collection of
multilingual speech and multi-modal databases.
Submit to and maintain protocols with Human Subject Review (HSR)
boards and US DoD Human Research Protection agencies.
Manage Data Collection Equipment and Facility:
Maintain and Manage the Group 24 sound room facility.
Specifying equipment needs, coordinating efforts across multiple
Groups, creating calibrated acoustic noise simulations.
Implementing Study Protocols for collecting multi-modal data from
human subjects.
Author and implement the procedures necessary to provide and
preserve the capability to perform in-field speech and acoustic
noise data collections and speech communication.
Laboratory Facilities:
Ability to work closely with the Facilities division of the client
to design and specify new laboratory spaces.
Ability to interface with the technical team, understand how the
laboratory spaces will need to be designed to address technical
needs, and communicate the design specifications to the Facilities
division.
Required Skills:
HLT Research Experience:
Experience with Java, Python, MATLAB, git, Digital Audio
Workstation (DAW) such as Adobe Audition, Audacity, SoX, Sound
Exchange, etc.;
Must include experience using machine learning techniques and
natural language processing tools to create HLT data sets.
Familiarity with foreign language corpus development is required
for this work.
Requires experience designing crowdsourcing jobs for text
annotation.
Experience with JSON and SQL Databases.
Experience directing subject matter experts to create interactive
foreign language training and testing materials.
Human Subjects Experience:
Authoring Study Protocols and successfully submitting them for
approval to client and DoD Human Subject Review Boards for the
purpose of multi-sensor data collections and language-learning
systems performance.
Demonstrated ability to train new personnel in implementing human
subjects data collection protocols is required.
Sound Room Management:
Specify and Maintain equipment.
Data Collection Hardware: MacOS and MS Windows platforms,
professional audio interfaces, loudspeaker playback systems, audio
microphone and multi-modal sensors (heart rate, skin conductance,
etc.) data collection systems.
National Instruments data collection systems; Portable audio
recording systems and Sound Pressure Level (SPL) meters.
Demonstrated ability to author and maintain Data Security Plans and
Loan Agreements for off-site equipment.
Solid understanding of audio equipment usage is required.
Independence and Reliability:
Demonstrated ability to work independently to complete complex
projects on a tight schedule.
Requires strong communication skills, interacting with various
client groups, human subjects, and subcontractors.
Demonstrated ability to lead and coordinate teams to produce
deliverables on tight deadlines.
HLT/Machine Learning:
Demonstrated experience implementing Machine Learning and in Human
Language Technology / Natural Language Processing Tools and
Services.
Software Dev-Ops:
Demonstrated ability to work in agile development cycle including
issues, projects, pull request review, UI and unit testing, Jenkins
build, Artifactory storage, and deployment.
Preferred Skills:
Experience with digital signal processing
Experience in Digital Speech Communication Test and Evaluation
Experience with JSON and SQL Databases
Experience in digital speech communication test and evaluation
Experience in extracting and analyzing data from social media
platforms
Skill Matrix:
Qualification Assessment
Must Have:
Data/Reporting
Speech research and development (design, management, and delivery
of rigorously specified data): 10 years
Experience
Currently holds a Secret Clearance (OR a higher clearance): Yes
Publishing Research
10 years
Sound Room Management
10 years
Human Factors
Human Language Technologies: 10 years
Human Subjects Research: 10 years
Machine Learning/AI
HLT/Machine Learning: 10 years
Software
Software dev-ops: 10 years
** This client is a US Federal Government contractor and is legally
required to hire US Citizens. US Citizens will only be considered
for this role.
Due to the nature of the work, a United States Government Clearance
is required to be eligible for the position. **
Keywords: Softworld Inc, Boston , Software Developer -Human Language Technology Data, IT / Software / Systems , Lexington, Massachusetts
Click
here to apply!
|