BostonRecruiter Since 2001
the smart solution for Boston jobs

Software Developer -Human Language Technology Data

Company: Softworld Inc
Location: Lexington
Posted on: March 20, 2025

Job Description:

Job Title: Software Developer - Human Language Technology Data

While professional experience and qualifications are key for this role, make sure to check you have the preferable soft skills before applying if required.

Job Location: Lexington, MA 02420

Onsite Requirements:

Speech Research & Design
HLT/ML Learning
Software DevOps

Job Description:

Position Scope and Job Functions:

HLT Data Collection:

Implement Natural Language Processing (NLP) and Machine Learning (ML) tools and techniques to create and enhance data for Human Language Technology (HLT) evaluation and applications.
Use DOMINO Lincoln workflow system for creating interactive foreign language training and testing materials.
Human Language Technology Evaluation:
Ability to identify and apply benchmarks to evaluate AI model performance.
Ability to create custom multi-lingual datasets for the evaluation of machine translation (MT) and automatic speech recognition (ASR) systems.

Generative AI:

Expertise in Large Language Models (LLMs), generative AI to operational systems.
Skills include programming abilities: llama-cpp, mistral, GPT4All, Chat-GPT, Orca, and transformer-based capabilities generally.

For Audio/Speech QA/QC:

Define, create, and implement audio applications for measurement and enhancement of audio and speech recording quality.
Assessing speech corpora integrity.
Coordinating with and providing guidance to subcontractors providing speech corpora.

Advanced Audio Data Analysis:

Ability to design, implement, and confirm the performance of an audio data collection method for the speech intelligibility evaluation of wearable acoustic sensors.

Human Subjects Protocols:

Design, author, and implement study protocols for the collection of multilingual speech and multi-modal databases.
Submit to and maintain protocols with Human Subject Review (HSR) boards and US DoD Human Research Protection agencies.

Manage Data Collection Equipment and Facility:

Maintain and Manage the Group 24 sound room facility.
Specifying equipment needs, coordinating efforts across multiple Groups, creating calibrated acoustic noise simulations.
Implementing Study Protocols for collecting multi-modal data from human subjects.
Author and implement the procedures necessary to provide and preserve the capability to perform in-field speech and acoustic noise data collections and speech communication.

Laboratory Facilities:

Ability to work closely with the Facilities division of the client to design and specify new laboratory spaces.
Ability to interface with the technical team, understand how the laboratory spaces will need to be designed to address technical needs, and communicate the design specifications to the Facilities division.

Required Skills:

HLT Research Experience:

Experience with Java, Python, MATLAB, git, Digital Audio Workstation (DAW) such as Adobe Audition, Audacity, SoX, Sound Exchange, etc.;
Must include experience using machine learning techniques and natural language processing tools to create HLT data sets.
Familiarity with foreign language corpus development is required for this work.
Requires experience designing crowdsourcing jobs for text annotation.
Experience with JSON and SQL Databases.
Experience directing subject matter experts to create interactive foreign language training and testing materials.

Human Subjects Experience:

Authoring Study Protocols and successfully submitting them for approval to client and DoD Human Subject Review Boards for the purpose of multi-sensor data collections and language-learning systems performance.
Demonstrated ability to train new personnel in implementing human subjects data collection protocols is required.

Sound Room Management:

Specify and Maintain equipment.
Data Collection Hardware: MacOS and MS Windows platforms, professional audio interfaces, loudspeaker playback systems, audio microphone and multi-modal sensors (heart rate, skin conductance, etc.) data collection systems.
National Instruments data collection systems; Portable audio recording systems and Sound Pressure Level (SPL) meters.
Demonstrated ability to author and maintain Data Security Plans and Loan Agreements for off-site equipment.
Solid understanding of audio equipment usage is required.

Independence and Reliability:

Demonstrated ability to work independently to complete complex projects on a tight schedule.
Requires strong communication skills, interacting with various client groups, human subjects, and subcontractors.
Demonstrated ability to lead and coordinate teams to produce deliverables on tight deadlines.

HLT/Machine Learning:

Demonstrated experience implementing Machine Learning and in Human Language Technology / Natural Language Processing Tools and Services.

Software Dev-Ops:

Demonstrated ability to work in agile development cycle including issues, projects, pull request review, UI and unit testing, Jenkins build, Artifactory storage, and deployment.

Preferred Skills:

Experience with digital signal processing
Experience in Digital Speech Communication Test and Evaluation
Experience with JSON and SQL Databases
Experience in digital speech communication test and evaluation
Experience in extracting and analyzing data from social media platforms

Skill Matrix:

Qualification Assessment

Must Have:

Data/Reporting

Speech research and development (design, management, and delivery of rigorously specified data): 10 years

Experience

Currently holds a Secret Clearance (OR a higher clearance): Yes

Publishing Research

10 years

Sound Room Management

10 years

Human Factors

Human Language Technologies: 10 years
Human Subjects Research: 10 years

Machine Learning/AI

HLT/Machine Learning: 10 years

Software

Software dev-ops: 10 years

** This client is a US Federal Government contractor and is legally required to hire US Citizens. US Citizens will only be considered for this role.
Due to the nature of the work, a United States Government Clearance is required to be eligible for the position. **

Keywords: Softworld Inc, Boston , Software Developer -Human Language Technology Data, IT / Software / Systems , Lexington, Massachusetts

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category
within


Log In or Create An Account

Get the latest Massachusetts jobs by following @recnetMA on Twitter!

Boston RSS job feeds