3. Guidance3.1 Evaluation Of Software TestingEvaluation of software testing is a complicated activity and the evaluations is further impacted by the amount of data produced by the software test. All software test data should be evaluated and compared to the expected results for the test. The evaluation process and evaluation tools used should be recorded for future assessments. If a software test evaluation tool has an error, the software test data may be misinterpreted by the evaluation team. The process for software test evaluations should be repeatable. Repeatable tests are a key element of evaluating the test results. The context of the test (including tools, compilers, code under test, procedures, …) all need to be recorded to provide context in the evaluation and limitations. Software testing is required regardless of project size. The level of rigor on the tests can be determined by the risk posture of the project. Safety Critical code needs to be tested and is a priority. Areas of higher risk or determined to be critical to success should take next priority. Unit, coverage, and other tests can be used for higher level tests but should be done with caution. While test articles (models, simulators, …) may or may not be used, testing on the actual “flight” hardware may damage equipment and should be considered (see SWE-073 - Platform or Hi-Fidelity Simulations). In all cases, maintaining records and the ability to repeat the tests in the same configuration is required to prove that issues found are resolved. Per NASA-GB-8719.13, NASA Software Safety Guidebook , the analysis methodology for software and system test results includes the following steps: - Verify that software and system test data meet the requirements for verifying all functional software safety requirements and safety-critical software elements.
- Verify via test coverage analysis that all safety requirements, functions, controls, and processes have been completely covered within the unit, component, system, and acceptance level tests.
- Verify that all software safety requirements have been tested, or evaluated, inspected, or demonstrated.
- Verify that all software safety functions are correctly performed and that the software system does not perform unintended functions.
- Verify that all safety requirements have been satisfied.
- Verify that all identified hazards have been eliminated or controlled to an acceptable level of risk.
For Class A software take care to analyze all software affecting safety-critical software and hazardous functionality including HR-33 - Inadvertent Operator Action, 3.2 Developing A Test Results Analysis MethodologyThe following from IEEE-STD-1012-2004, IEEE Standard for Software Verification and Validation, are also appropriate considerations when developing a test results analysis methodology: - Validate that software correctly implements the design.
- Validate that the test results trace to test criteria established by the test traceability in the test planning documents.
- Validate that the software satisfies the test acceptance criteria.
- Verify that the software components are integrated correctly.
- Validate that the software satisfies the system requirements.
Other elements for the evaluation methodology include: - Verify that the test results cover the requirements.
- Determine if actual results match expected results.
- Verify adequacy and completeness of test coverage.
- Determine the appropriateness of test standards and methods used.
3.3 Items Used To Generate And Collect The ResultsFor all levels of software testing (unit, component, integration, etc.) capture and record items used to generate and collect the results. These items are an important part of analyzing the test results since some anomalies could have been caused by the tests themselves. The following are captured, not only for results analysis but for future regression testing: - Simulators.
- Test drivers and stubs.
- Test suites.
- Test data.
3.4 Other Inputs To AnalysisIn addition to the information used to generate test results, the following may be important inputs to the analysis of the result: - Discrepancies found during testing (e.g., discrepancies between expected and actual results).
- Disposition of discrepancies.
- Retest history.
3.5 Analysis PracticesWhen performing the actual test results analysis/evaluation, consider the following practices : - Use application or domain specialists as part of the analysis/evaluation team.
- Use checklists to assist in the analysis and ensure consistency.
- Use automated tools to perform the analysis, when possible.
- Capture a complete account of the procedures that were followed.
- If a test cannot be evaluated, capture that fact and the reasons for it.
- Plan the criteria to be used to evaluate the test results, consider (from a 1997 University of Southern California Center for System and Software Engineering project file entitled, “Software Test Description and Results”):
- The range or accuracy over which output can vary and still be acceptable.
- The minimum number of combinations or alternatives of input and output conditions constitute an acceptable test result.
- Maximum/minimum allowable test duration, in terms of time or number of events.
- The maximum number of interrupts, halts or other system breaks that may occur.
- Allowable severity of processing errors.
- Conditions under which the result is inconclusive and retesting is to be performed.
- Conditions under which the outputs are to be interpreted as indicating irregularities in input test data, in the test database/data files, or test procedures.
- Allowable indications of the control, status, and results of the test and the readiness for the next test case (maybe the output of auxiliary test software).
- Additional criteria not mentioned above.
- Any information about the setup of the test (including versions of tools, hardware, simulations, …) to make the test repeatable and provide context on assumptions and limitations.
When recording the outcome of the analysis, important items to include are: - Major anomalies.
- Problem reports were generated as a result of the test.
- Operational difficulties (e.g, constraints or restrictions imposed by the test, aspects of the requirement under test that could not be fully verified due to test design or testbed limitations).
- Abnormal terminations.
- Reasons/justifications for discrepancies (e.g., caused by test cases or procedures, not a product issue).
- Any known requirement deficiencies present in the software element tested.
- Corrective actions were taken during testing.
- Success/failure status of the test.
Additional guidance related to software test results may be found in the following related requirements in this Handbook: NPR 7150.2 - Section 4.5 SWEs including: 3.6 Additional GuidanceAdditional guidance related to this requirement may be found in the following materials in this Handbook: 3.7 Center Process Asset Libraries
See the following link(s) in SPAN for process assets from contributing Centers (NASA Only). |