Test Efficiency

Test Efficiency measures help researchers determine the best way to use available resources. One way to achieve this efficiency is by using Design of Experiment (DOE) best practices. Best practices outline how to identify test objectives, select outcome measures, and select factors that affect outcomes. Additionally, if testers want to use the results of one test to inform the design of a new test, they can incorporate sequential testing techniques which have slightly different definitions and approaches depending on the scenario. The Test Science team explores available variations of DOE methods and ways to implement them in defense tests.

Research Paper

Determining How Much Testing is Enough: An Exploration of Progress in the Department of Defense Test and Evaluation Community

Research Paper

Statistical Methods for Defense Testing

Design
JEDIS JMP AddIn

JMP Add In

Analysis
Bayesian Binomial Credible Intervals

Interactive Shiny App

Modeling & Simulation

Modeling & Simulation (M&S) is defined as using a representation of a system (a model) in which tests (simulations) can be run to gain information on the real system. Weapon system evaluations are becoming increasingly more reliant on M&S to supplement live testing. In order to have valuable supplements, testers must understand how well these models represent the simulated systems or processes by quantifying uncertainty in the M&S results. The burgeoning research field of Uncertainty Quantification includes concepts such as validation, calibration, and discrepancy modeling. The Test Science Team researches methodologies for applying these concepts to M&S with the goal of improving weapons systems.

Handbook

Handbook on Statistical Design & Analysis Techniques for Modeling & Simulation Validation

Research Paper

Space-Filling Designs for Modeling & Simulation

Research Paper

Metamodeling Techniques for Verification and Validation of Modeling and Simulation Data

Other

An Uncertainty Analysis Case Study of Live Fire Modeling and Simulation

Human-System Interaction

Human-Systems Interaction (HSI) research investigates why and how to improve user engagement with systems that now include artificial intelligence, robotic teammates, and augmented reality. Improvements in technology, engineering, and cyberspace have led to the development of complex, feature-rich, capable systems, which have changed the ways humans interact with the more advanced systems. The cost of this complexity can range from failure to fatalities. The Test Science Team applies HSI across the DoD, including surveying, developing best practices for metric analysis, creating new methods for evaluation, and designing statistical models for assessing human-machine teams (HMT).

Technical Briefing

Characterizing Human-Machine Teaming Metrics for Test & Evaluation

Research Paper

A Multi-Method Approach to Evaluating Human-System Interactions during Operational Testing

Analysis
Validated Scales Repository

Other

Analysis
System Usability Scale (SUS)

Interactive Shiny App

Autonomy & AI Enabled Systems

Autonomy, in this context, is defined as systems that can perform tasks with no external influence. As technology advances, researchers including the DoD are becoming more interested in how machines can perform without the aid of humans. The methodologies that the DoD employs for testing standard systems have a high likelihood of mischaracterizing risk and performance when analyzing this emerging technology. Autonomy research includes concepts such as environment perception, decision making, operation, and ethics. The Test Science Team is working to develop a framework for testing autonomous or AI-enabled systems.

Research Paper

Test & Evaluation of AI-Enabled and Autonomous Systems: A Literature Review

Handbook

Trustworthy Autonomy: A Roadmap to Assurance -- Part 1: System Effectiveness

Research Paper

Initial Validation of the Trust of Automated Systems Test (TOAST)

Technical Briefing

Demystifying the Black Box: A Test Strategy for Autonomy

Subscribe

Our Research

Document Library

Test Efficiency

Request Consult

Modeling & Simulation

Request Consult

Human-System Interaction

Request Consult

Autonomy & AI Enabled Systems

Request Consult