IAT Tailoring and Metrics

IAT

Software Instrumentation and Analysis
Using The Analysis Reports
Running Your Own Software Analysis
Testing and LabReduction
Regression Analysis
Process and Traceability
Other Serives

Lessons Learned
Internal Operations
Regular Expressions

IAT tailoring and metrics
Tailoring FAQ
Metrics FAQ

The IAT was created on the Project-O program. It is a tool platform that comes with an engineering process. The process is documented and available for consumption. It includes when and how IAT is used during various phases of the program. Project-N is the first program to attempt to adopt the IAT. IProject-N is choosing to modify the process established on Project-O. The following tables summarize the changes being made to the IAT and its Project-O processes. The Project-O IAT engineering process is summarized at Process and Traceability and can be found by accessing the project documentation such as plans and related memos. Tailoring FAQ and Metrics FAQ

IAT process and metrics across multiple projects is summarized in the following tables. Project metrics . IAT Application . Automated Code Analysis Detectors . Reports . IAT Web Enabled Services . Possible Future Enhancements

Process and Metrics

Metrics

The IAT produces metrics reports automatically and transparently as its services are provide to support peer reviews, code walk throughs, and instrumentation. The reports include automatic comparisons to a previous baseline and to a selected "gold standard" baseline. This allows an analyst to quickly determine if things are getting better, worse, or staying the same. There is a great deal of data that is collected and categorized and visually organized so that a user can quickly spot potential issues. There is also a big picture summary at the end of the detailed metrics areas which offers an "IAT judgment" on the collection of code. The IAT judgment call is marked as WORSE, BETTER, or SAME.

zz-iatdemo-B1.0
ft-app-B1.0
ft-app-B1.1

Analysis Group Stats This area gathers traditional metrics such as LOC and number of Req's. But it provides different views of those fundamental metrics such as ratio of "comments" to LOC.

Possible Problems This area is a remnant from the stats-problems report, but it provides a description of each metric item in the graphical sections of the report.

Metrics - Percents Within Collections This area provides metrics in terms of percent of the code analyzed. This is a great way to compare peer reviews (small collections of code) against a previous baseline release (the whole thing). It is also a great way to compare different projects which follow similar, but not exact, coding standards. For Example, assume version A has 10 items flagged and is 10,000 LOC while version B has 20 items flagged but is 50,000 LOC. Even though version B has more items flagged one could argue that it is better than Version A since as a percentage, the flagged items are lower.

Metrics - Counts Within Collections This area provides metrics in terms of absolute counts. It is used to compare version releases on the same project and to get an assessment of the exact number of problems.

In all cases the bar charts represent a deviation from the norm. The norm is the green area. In general, for the detailed problems report area anything to the left on the bar chart is better and anything to the right is worse. A pegged needle is red. The bar charts at the top of the report are not in terms of good or bad, just different. The up down triangles are another indication of how the software is progressing. Green tends to indicate progress and red tends to indicate entropy, hence the names of the reports, entropy.html :)

The following is a brief description of the Analysis Group Stats metric items.

Total Source Lines This is a count of all the lines.
Total Non Blank Lines This is a count of all the comments, headers, and code.
Total Logical This is a subset of the total LOC.
Total SemiColons This is a subset of the total LOC.
Total LOC This is the sum of the total logical and semi colon count.
All Files Count of all the files analyzed.
Files - C Count of all the C files analyzed.
Files - H Count of all the H files analyzed.
Files - ASM Count of all the ASM files analyzed.
Functions - C Count of all the functions located by the IAT. The IAT uses the function headers to locate functions.
Log Events in Code Count of the log events.
TOC Reqs in Code Count of all the TOC and PS req's located in the files analyzed.
TOC Reqs in SRDB Count of the req's in the SRDB file fed to the IAT. Rules are used to separate a requirement from a description.
Debug Events Count of the debug log events.
Code Req per SRDB Req The number of req's located in the code divided by the number of req's located in the SRDB.
LOC per File Average Lines of code per file.
LOC per Function Average Lines of code per function.
Functions per File - C Average number of functions per file.
Non Blank Lines Per LOC If headers are considered comments, this is a comment to LOC ratio.
LOC Per Logical Indicates LOC for each decision.
C Per H Ratio of C to H files. It should be greater than 1.
LOC per Req Ratio of the lines of code to req's found in the code.
Functions per Req Ratio of functions to req's found in the code.
Logevents per Req Ratio of log events to req's found in the code.

The following is a brief description of the Analysis Summary metric items. These items are reported in terms of flagged LOC, files, and/or functions.

Fatal Errors These are artifacts of instrumentation that will lead to compile errors most of the time, such as printf's, breaking if-else sequences, splitting declaration areas, and instrumenting previously instrumented code.
C Header The .c .h and assembly headers are separately checked for the appropriate fields. Missing fields, wrong sequence of the fields, or missing headers are detected. Extra fields are ignored. Valid format:
H Header The .c .h and assembly headers are separately checked for the appropriate fields. Missing fields, wrong sequence of the fields, or missing headers are detected. Extra fields are ignored. Thread ID is optional and will only appear in the extra fields note if there is another header problem. Valid format:
ASM Header The .c .h and assembly headers are separately checked for the appropriate fields. Missing fields, wrong sequence of the fields, or missing headers are detected. Extra fields are ignored. Valid format:
Must Fix Classification The header classification field must be marked, no other markings are permitted. The file naming convention is checked against the header classification marking. Keywords that may trigger a higher classification are compared against the file naming convention. Missing headers against classified file names are detected.
Possible Classification Issues The header classification field must be marked, no other markings are permitted. The file naming convention is checked against the header classification marking. Keywords that may trigger a higher classification are compared against the file naming convention. Missing headers against classified file names are detected.
SV Marking The SV field is checked for YES or NO marking. Blank or other markings are detected and noted in the detailed message. The SV header field must be present for this check to detect an error. If the SV header field is missing, that is noted in the bad header area.
CV Marking The CV field is checked for YES or NO marking. Blank or other markings are detected and noted in the detailed message. The CV header field must be present for this check to detect an error. If the CV header field is missing, that is noted in the bad header area.
Fixed Keywords Code Although there is a separate keywords report, there are several keywords that must be explained or removed. The bad fixed keywords form this set. Examples include tbd, tbs, tbsl, demo, ?'s etc.
Fixed Keywords Prologue Although there is a separate keywords report, there are several keywords that must be explained or removed. The bad fixed keywords form this set. Examples include tbd, tbs, tbsl, demo, ?'s etc.
Conditional Compiles ifdefs Conditional compiles can become a problem if they are not properly managed.
Missing Curly Braces If and Else statements should enclose their operations with curly braces. Missing curly braces can lead to potential problems as the code changes and if or else blocks are potentially modified to include more than one code statement.
Switch Default balance green is good red is bad All switch statements must be paired with a default statement. Missing default statements are detected exclusive of comments.
Default gsiSwError balance green is good red is bad All default statements must call gsiSwError. Missing gsiSwError calls and calls that are commented out are detected.
Case Break balance All case statements with code should be paired with a break or return statement. Missing break statements are detected exclusive of comments. Without a break the code is less robust and prone to disintegration when modified.
Nested Switches Some case statements have a nested switch sequence.
Stacked Case Statements Some case statements are stacked.
Calling Rules The software is checked to determine if calling rules are violated.
File with: No Error Exits Most of the software should include an error exit. This is not a hard and fast rule, but may need to be justified.
Files with: 15 or more Functions Good practices limit the number of functions to in any given source file.
Files with: 500 or more than LOC Good practices limit the number of LOC to in any given source file.
Functions with: 100 more than LOC For proper analysis there must be a valid header termination before the start of a function. Good practices limit the LOC in any given function.
C Functions with: less than 5 LOC For proper analysis there must be a valid header termination before the start of a function. Good practices limit the LOC in any given function.
Dead Code This is a very simple detector looking for single line commented out code. Code that is block commented out is not detected. Dead code that is the result of logical errors is not detected.
Log Event Logevents are subjected to several checks to determine if there may be a potential problem. The checks include redundant text, consecutive events without program code in between events, encoding problems (LE SV not LE, not SV, not TE, etc), potentially placing a logevent at the start of a procedure rather than at the conclusion, potentially placing a logevent within a loop.
Line Length Report lines greater than characters where characters are CR/LF. Ignore lines generated by clear case, they exceed .
do Loops do loops have long been considered a bad practice since they tend to invite endless loops into into the code.
goto Statements goto statements have long been considered a bad practice and result in unstructured code.
?: Operator This operator leads to confusion many times and should be avoided.
Storage malloc free Requesting storage using malloc, alloc, calloc should be minimized and include a symetrical free cfree sequence. This can lead to memory leaks if not properly freed. There are also SV related issues.
Type ReCasting Recasting variables should not be practiced since it can lead to truncation of data.

The following is a brief description of the Analysis Summary metric items at the end of the metric details. They represent a total of the metrics and include the following items.

LOC Total number of LOC flagged from the individual metric item areas.
Functions Total number of functions flagged from the individual metric item areas.
Files Total number of files flagged from the individual metric item areas.
Total Up Flags Total number of up tic's. As the up tic's increase, things get worse.
Total Down Flags Total number of down tic's. As the down tic's decrease, things get better.
Total Same Flags Total number of neutral tic's. As the neutral tic's increase or decrease, examine the up and down tic's to get a feel for what happened.
Final Tally of Flags This is the IAT judgment call indicating if this release is better or worse than the baseline. This result appears both in the percent and count areas. Remember that the percent area is an attempt to normalize against the size of the analysis package. If both areas agree then IAT is firm on its judgment. If there is a discrepancy between the 2 areas, then using the normalized result in the percent area is the more prudent answer.

**IAT Application**
No.	Application & Use	Project-O	Project-N	Project-P	Comments
1	Analysis of code with log events in the R boundary	yes	yes	yes
2	Analysis of code without log events in the R boundary	yes	no	no
3	Analysis of code ouside the R boundary	no	no	yes
4	C Instrumentation in the R boundary	yes	yes	yes
5	Assembly Instrumentation in the R boundary	no	yes	TBD
6	Instrumentation outside the R boundary	no	no	yes	The software team used a trace buffer with printf's on the ARM processor to support integration and debug.
7	Comment statements include descriptive text	yes	no	yes
8	TOC / BFW reqs instrumented	yes	yes	yes
9	Other reqs instrumented	partially	partially	yes	Project-O had some SRS req's. Project-N will have some product spec req's.

10	Used to support peer reviews	yes	TBD	TBD
11	Used to support code walk throughs	yes	TBD	TBD	Project-O code walk throughs were eventually merged with the preparation for the formal test.
12	Used to support formal testing	yes	yes	yes
13	Used to support next release and regression testing	yes	yes	yes	There was significant Project-O next release testing.
14	Used to support customer questions	yes	yes	yes	Web enabled search services
15	Used to support Req's extraction for new releases	yes	yes	yes	Web enabled filter services

**Automated Code Analysis Detectors**
No.	Code Analysis	Established	Project-O	Project-N	Project-P	Comments
1	C Headers	2001	yes	yes		This verifies the prologue header
2	H Headers	2001	yes	yes		This verifies the prologue header
3	ASM Headers	2001	yes	yes		This verifies the prologue header
4	Classification	2001	yes	yes
5	Fatal Errors	2001	yes	yes		printf was removed for Project-N
6	SV Marking	2001	no	no		This detector was turned off in Project-O 2.0
7	CV Marking	2001	no	no		This detector was turned off in Project-O 2.0
8	Fixed Keywords	2001	PR-yes FT-no	yes		The Project-O fixed keywords are: C- tbd, tbs, #if, #else, #error, #endif, #if defined, ifdef, while(.), malloc H- tbd, tbs, #elif, #else, #error, #if defined, ifdef, while(.) The Project-N fixed keywords are: C- tbd, tbs, tbsl, goto, demo, #pragma, free, malloc, ? H- tbd, tbs, tbsl, ? S- tbd, tbs, tbsl, ? For Project-O 2.0 the detector is turned off for FT (final test - release) the fixed keywords are: C- tbd, tbs, tbsl, demo, free, malloc H- tbd, tbs, tbsl, demo
9	Missing Curly Braces	1Q 2002	yes	no		Project-O did not originally include this feature. This was created for NE100. Project-N currently does not use the feature.
10	Switch Default Balance	2001	yes	yes
11	Default Error Balance	2001	yes	yes		Project-N does not appear to have a common error calling approach
12	Case Break Balance	2001	yes	no
13	Nested Switches	2001	no	no		Project-O used this feature up until rel 1.5
14	Stacked Cases	2001	no	no		Project-O used this feature up until rel 1.5
15	Calling Rules	2001	yes	no		Project-O uses a CPC naming convention to support the calling rules.
16	No Error Exits	2001	no	no		Project-O used this feature up until rel 1.5. The detector actually surfaces a different problem. Most files that have no error exits also have less than 5 LOC in the files. Usually its 1 LOC.
17	Greater Than 15 Functions per module (file)	2001	yes	desired		Project-N has missing function headers making it difficult to find functions For Project-O 2.0 the functions were increased from 6 to 15
18	Greater than 100 LOC per function	2001	yes	desired		Project-N has missing function headers making it difficult to find functions
19	Less than 5 LOC per function	2001	no	desired		Project-N has missing function headers making it difficult to find functions For Project-O 2.0 the detector was turned off
20	Uncalled Functions	1Q 2002	PR-no FT-yes	desired		Project-N has missing function headers making it difficult to find functions For Project-O 2.0 the detector was turned off for PR (peer review)
21	Dead Code (Commented Out Code)	2001	no	yes		For Project-O 2.0 the detector was turned off
22	Fuzzy Log Events	2001	yes	yes		The detector has been turned down to concentrate on the serious problems. Future IAT may include a switch to enable these other items as a separate control option.
23	Line Length (selectable)	2Q 2002	PR-yes FT-no	no		This was originally desired on Project-N and is also part of the Project-P SSPM For Project-O 2.0 the detector was turned on for PR
24	do Loops	2001	no	no		Disabled early on Project-O program
25	goto	2001	yes	no		Disabled early on Project-O program For Project-O 2.0 the detector was turned on
26	?: operator	2001	PR-yes FT-no	no		Disabled early on Project-O program For Project-O 2.0 the detector was turned on for PR
27	++/-- within if, else, while	2Q 2002	no	no
28	500 LOC per File	2001	yes	no		Disabled early on Project-O program For Project-O 2.0 the detector was turned on
A	Tagging of reused code in all results	2001	yes	no		This requires a file naming convention. Project-O has one that allows for the quick ID of re-used code.
B	Tagging of new, modified, or untouched code	1Q 2002	yes	TBD		This comes into play as new functionality is rolled out with new releases.
C	Keywords in the keywords-all.html report	2001	yes	TBD		All automated detectors started with a key word search in the keywords-all.html report. The Project-O keywords are: debug, delay, return; , while (d+), enable, disable, exception, demo, tiger, condor, private, reuse, pragma, asap, alarm, stub, malloc, alloc, tbd, tbs, FNBDT_230, FNBDT_210, do, don.s, dedicated, buffer, timeout, global, free, new, delete, goto, mutable, friend, explicit The Project-N keywords are: same as Project-O
D	Quality Metrics	2Q2002	TBD	TBD		For each detector there is a count of bad vs good and a perecentage. It also includes a total at the end.

PR - Peer Review
FT - Final Test / Release

**Reports**
No.	Report	Established	Project-O	Project-N	Project-P	Comment
1	codegen	2001	yes	yes		work horse report
2	le-summary	2001	yes	yes		third priority report
3	stats-problems	2001	yes	yes		work horse report
4	simulate	2001	yes	yes		third priority report
5	data reduction	2001	yes	yes		work horse report
6	error handling	2001	yes	yes		second priority report
7	keywords-all	2001	yes	yes		second priority report
8	hamming distances	2001	yes	yes		third priority report

10	abstracts	2001	yes	yes		not really used
11	comments	2001	yes	yes		not really used
12	cpc-files	2001	yes	yes		not really used
13	watchdog timer	2001	yes	yes		not really used

14	cpc-listing-c	2001	yes	yes		used by the IAT to provide hyper links to line of code being examined
15	cpc-listing-h	2001	yes	yes		used by the IAT to provide hyper links to line of code being examined
16	cpc-listing-asm	2001	yes	yes		used by the IAT to provide hyper links to line of code being examined

17	sc-var-extern	1Q 2002	no	no		IAT experiment to address finding sc variables
18	sc-var-incodeblock	1Q 2002	no	no		IAT experiment to address finding sc variables
19	sc-var-possible	1Q 2002	no	no		IAT experiment to address finding sc variables
20	sc-var-public	1Q 2002	no	no		IAT experiment to address finding sc variables

**IAT Web Enabled Services**
Service	Established	Project-O	Project-N	Project-P	Comments
Enterprize Portal	2001	yes	yes	yes	Used for training and bringing people up to speed with limited project access.
Project Portal	2001	yes	yes	yes	Used to navigate the project IAT products.
Search	1Q 2002	yes	yes		Used by systems and test to search the current software "baseline".
Filter	1Q 2002	yes	yes		Used by systems, test and software to extract reqs from a specification. A unique subtract feature is used during the test phase to find req coverage differences between test runs.
Compare	1Q 2002	yes	yes		Used to compare test runs.
Enterprize Mgmt	1Q 2003	partial	partial		If not enforced, IAT points of contact do not use the service. Part of the problem is they have no direct benefit and the system is not on the project portal.
Program Mgmt	4Q 2003	TBD	TBD		Attempt to address the operational limitations of Enterprize Management. Use of this service will help the IAT points of contact to create their products by automatically creating their directories.

**Possible Future Enhancements**
Project-O	Project-N	Project-P
For future projects establish a naming convention for security critical variables. This will facilitate the hunt for security critical variables and enhance many existing reports. For future projects abstract the security requirements at a higher level. In the absence of a naming convention for security critical variables, create a mechanism that will facilitate such a search. Automate the task of manually creating certain IAT unique directories. Currently 4 directories need to be manually created.	Disable some of the detectors.* Simplify the directory structure holding the instrumented code. Instrument assembly code.*	Abstract the security requirements at a higher level, remove design detail.

* item has been incorporated into IAT

Tailoring Frequently Asked Questions

Why do I need to add descriptive text to my log events why can't the IAT attach the actual req text based on the PUI number?
What does IAT do and does the IAT do anything positive for a program?
I don't have time to place the log events into the code and it is a huge burden and overhead to the software staff?
I only want a specific set of things from the IAT during code analysis so why are we talking about other things associated with the code?
Why do I need PUIs in the DOORS database?
Why do I need to run the IAT to support my internal peer reviews?
What is CM and why do I need to understand it?
Where did the IAT come from?
Where is IAT Going?
How do I know I should use the IAT on my program?
How was the IAT validated and can I trust its outputs?
This is really hard and my head is going to explode so why can't software just focus on the functionality of the box?
Ok I am sold, but if I do this how does the data come out of the device during test?
Wooow, wait a minute, this is not my job and this is going to cost extra money and time, we are not doing this?
You know, I did all this and I am getting 140 log events per second coming out of my box?

1. Why do I need to add descriptive text to my log events why can't the IAT attach the actual req text based on the PUI number?

Yes the IAT can apply the original req text in its reports. In fact one of the existing reports does exactly that function. However, experience has shown that req's are satisfied in multiple locations and so a log event should provide some indication of the portion of the req satisfied by the previous code. Further, when someone attempts to look at the listings, the log events provide no information without the descriptive text. Some folks still rely on examining the listing to understand and solve their particular problems (e.g. BAH).

The testers rely heavily on the descriptive text. It is succinct and supports their needs during the test and during initial phases of analysis without being drowned in a sea of words and meaningless statements. In fact the original approach was to provide the req text to the testers when the original data reduction report was created. It was actually easier to implement the PERL for this approach. It was a bad solution. Although the data reduction report still contains that switch option, it is never used.

Finally, some folks have been wondering what it is specifically that the IAT does to help increase the quality of the product. Some folks think that the process of thinking about a req and what it means so that it can be placed properly by the developer provides insights into the design that may have been previously missed. A light bulb goes off in the developers heads as they finally reach a resolution to each of the 1% of the log events that are difficult to place and "paraphrase".

2. What does IAT do and does the IAT do anything positive for a program?

As far as what IAT does, it:

Loads in the real software source code
Loads in the real reqs from the SRDB (DOORS report)
Instruments the source code thus reducing emulator break points
Looks at the source code and provides reports for peer reviews and code walk throughs
Correlates raw test data with log event comment statements in the source code
Correlates raw test data with the SRDB req's
Provides very specialized and unique searches, compares, and filtering of software, req's, and test results

The Project-O customer has stated that they think IAT was one of the factors in why they received a superior product. Some folks think that the process of thinking about a req and what it means so that it can be placed properly by the developer provides insights into the design that may have been previously missed. This results in a better product as the realization sets in that a log event does not fit someplace.

Other folks think that it is a forcing function. In the process of running IAT certain things that may be ignored in a program till it is too late, surface as part of the process of running IAT.

Finally, some folks think the IAT puts fear into the developers hearts and they are coerced into jumping through the hoops.

3. I don't have time to place the log events into the code and it is a huge burden and overhead to the software staff?

A funny thing happened on the way to this process, a 1980's thing, but not exactly. This thing actually makes sense. Work previously done by a chief programmer type and a test engineer has been pushed to the developers, --- but not really.

In the past the chief programmer would annotate the code in anticipation of a formal code walk through with the government. This would include going back to the source (original developer) to determine or validate where a requirement was satisfied. This function is now performed once and it is done by the most knowledgeable member of the program, the original developer.

Also, as part of the old test process, emulator break points would need to be identified to support SV testing. In the hunt for the elusive breakpoint, which would work given all the peculiar characteristics of an emulator, the original software developer again would be called in to identify many if not all of the points and emulator settings. Introducing log events reduces emulator breakpoints in the final test phase freeing the software developers to work future functionality or correct deficiencies.

Upon further reflection, the work was not really pushed to the software developers. In the past the work was just repeated - do it in the beginning, in the middle, and in the end. The work would be capture in various disjoint information products such as SRDBs (DOORS), SDFs, memos, etc. In this case, with the IAT, the information is not duplicated (or lost) but capture in the final product, the source code.

This process adds some rigor to the original work performed by the software staff so that it is effectively captured and not revisited during different phases of the program. The rigor is the process of placing the log event in the appropriate location in the code.

If you still think this is a huge burden, some stats on Project-O provide further insight into the overhead issue. There are approximately 128,000 lines of stuff where 28,000 is executable code for 1052 log events. That is 0.8% or 3.8% depending on how you look at it. For me the answer is 0.8% because someone needs to enter all that stuff into the code which is not executable.

4. I only want a specific set of things from the IAT during code analysis so why are we talking about other things associated with the code?

The IAT is not a replacement for the human. In the end it is the analysts that determine whether there is a problem with the software that should be addressed. So the IAT is a tool to help crunch through the stack of paper. It does this by providing different isolated views so that the analyst can focus on specific areas. For example, all decisions (if, else, switch, case, etc) are pulled out and color coded in a report so that an analyst can determine if there are special needs unique to this domain. So, if in the process the analyst sees that there are 4 lines of code on the same line (and the coding standards are silent) the analyst should not remain mum. The analyst needs to raise this issue at the review. Anything else would subvert the purpose and spirit of the IAT and it process.

Further, the detectors and reports are in a constant state of evolution. This evolution is based on the fact that the analysts constantly need to respond to new questions as they arise on a program and the IAT is a tool to help them answer those questions. For example, if it is determined that a type recast has caused a problem, the analyst has 3 choices: (1) do nothing, (2) print out the hardcopy and look for type recasts in all the source code using a yellow highlighter, or (3) spend 15 minutes and add a new IAT service to look for type recasts. Many folks do not understand this simple concept and they resist adding to the IAT because it is viewed as adding more work to the software effort, which by the way has nothing to do with getting basic functionality out the door. However, from a program perspective there is only one right answer, anything else would leave the program open to risk. One could even argue that the DR should not be closed until all type recasts are checked.

5. Why do I need PUIs in the DOORS database?

Project Unique Identifiers exist so that keyword searches return expected results. The IAT reads in the DOORS data base. If PUIs are not established then there is the risk of req text being confused with attributes as the IAT performs its various automated correlations. In the end you won't be able to track the progress of your req's being implemented with each review or release.

6. Why do I need to run the IAT to support my internal peer reviews?

Because you won't have time to fix all the things that will be surfaced to support either a formal code walk through or test event.

This is independent of IAT. It is a fundemental principal of software engineering to catch things as early as possible and avoid them as future development continues. So why would anyone want to remove critical information at that stage, wheither it comes from a person or the IAT or a person who uses and understands the IAT?

7. What is CM and why do I need to understand it?

The IAT fundemental input is software source code and req's from the SRBD (System Reqs Data Base). If you don't know where to find the software to support each activity, you can't provide the service. The activities are peer reviews, code walk throughs, and test / releases.

8. Where did the IAT come from?

The original IAT was focused only on instrumentation. A prototype was developed in a few days which instrumented some code, simulated the results, and performed data reduction.

Since IAT examines each line of code, it became apparent that it might be useful in helping to support reviews. The idea was to use the Internet approach to search for things but rather than return web pages, return lines of stuff from one or more files. The original code analysis extracted source code instructions and removed the comment and header information. The goal was to shrink the size of the paper stack. This then evolved to create reports where only comments, or abstracts were provided in addition to just the source code instructions report. These reports are still present, but rarely used. The code reports eventually evolved into the details area of the new reports. Eventually a report was created based on searches for multiple keywords. The keywords search eventually evolved into automatic detectors that are now part of the stats-problem report and three new reports: hamming distances, error handling, and watchdog timer. Early in the activity the reports were stand alone with no hyper links. Eventually the reports evolved to include hyper links across the reports, source code, and specifications.

The IAT was eventually fitted with services to support the back end test portion of a program. The original data reduction capability was present, but rather than have it process through instrumented code it was modified to process through live log events from a target test configuration. A regression analysis capability was added which performs a bi-directional DIFF and lists new, modified and deleted files. Eventually web enabled services were added to search through the software repository, filter the req's, and compare test data from different test runs.

The final stage of the IAT was the creation of a web portal with links to its services and to IAT project areas. The portal exists for each project and for this generic space. So the IAT has evolved into a tool platform that provides services that tie many things together on a program which previously were not possible. Further it does many of the manual tasks that no one really wanted to do in the past, like pour through stacks of source code listings or find workable emulator break points.

9. Where is IAT Going?

There were 3 alternatives presented:

Move it out and contact the incubator across the street.
Shrink wrap it into a product and or service and offer it to other groups, DOD companies, Government, and eventually commercial enterprises.
Keep it in house and perhaps convert it to a best practices entity.

The current approach is (3), Keep it in house and perhaps convert it to a best practices entity. Note: The government has formally stated that they would like to place it on their internal shelf of tools at their site.

10. How do I know I should use the IAT on my program?

We could talk about mission critical, safety, and security but it is all irrelevant.

If you have taken the time to conclude that you need an SRDB and you have taken further time to actually populate that SRDB using a tool like DOORS then you should use IAT. Why? Because all the time you spent on the SRDB is a waste unless that SRDB is connected to the software. The IAT is the only thing that will connect all that work to your software in a precise unambiguous manner. Further, the connection is performed once as part of a process and activity that already exists when the software folks find each of their collection of req's for each of their pieces prior to implementation. There is no duplication of activity where systems and test revisit these allocations. In the past the connection, and I use that word loosely, was performed by testers as they allocated requirements to their test procedures. The link to the actual software was approximate via the test cloud and imprecise via word processing products.

If you do not plan to use the IAT then don't use DOORS. Use word processing tables and excel spreadsheets. The rigor available via DOORS is lost once views of the software as implemented are pulled. The original CPC allocations in DOORS do not match the final as implemented code.

11. How was the IAT validated and can I trust its outputs?

IAT evolved on the Project-O program over an 18 month period. It has approximately 500 hours of execution time that was used to support approximately 75 peer reviews and approximately 15 internal / external releases.

It started as just simple Internet based search queries based on regular expressions and grew to its current state. The search queries were validated by the analyst as the findings were understood by clicking to the actual source code or req's. The search queries stayed in place and were used to verify the more elaborate mechanisms such as the automated error detectors in the stats-problem report. So there is built in redundancy in the views of data where in the end the analyst can always go back to the source code or req text. That is how each detector and keyword search regular expression was matured.

The instrumentation was built with the same redundancy concepts. When instrumentation is performed a summary report is provided. Links to the instrumented and non instrumented code are easily accessible and bound the instrumentation results. Finally there is an automatic DIFF that is performed, which can not be bypassed, whose output uses color and layout to quickly ascertain if instrumentation performed as expected.

So the elements to the IAT that instill trust and validation of its products are:

Evolution from simple to more complex
Significant time in use on a real program
Quick HTML access to alternative views
Quick HTML access to the original source code and req's text
Automatic DIFFs where needed
Keyword backups to automated detectors

Can you trust the IAT? Only if you use its services. That means using the hyper links and thinking about the findings.

12. This is really hard and my head is going to explode so why can't software just focus on the functionality of the box?

If we just focus on the functionality of the box there is no need for our services. Anyone can then do this and offer the product for $19.95. This may sound like a funny statement, but there is a significant element of truth. A device is not just about functionality. It is also about performance. We happen to work in an area which requires a certain level of performance that few organizations are able to provide at this time. The IAT attempts to add automation to a process that is years in the making. IAT and its processes may not be perfect but do you have a better alternative? If so, go for it, that's fundamentally what we all do anyway, but watch out for exploding heads :)

13. Ok I am sold, but if I do this how does the data come out of the device during test?

The IAT converts the comment statements to a function call. This unique function is responsible for getting the data out to an external interface port. It does not matter what the data format is, however ASCII is strongly suggested. It does not matter what fields are included in the output, however an internal counter along with the log event is strongly suggested. Experience has shown that 4 decimal digits for the log events prefaced by a decimal counter is sufficient to do everything IAT needs for its test data analysis. The counter is used to determine if data is internally lost. This is accomplished with an IAT counter check during data reduction.

This unique function call is very important because you want to make sure you keep the overhead to a minimum. Most operating systems and executives like PSOS and TIs' DSP BIOS provide unique services to support this capability. You just need to search for keywords like "log, log_event, and event" in their documentation. It's usually less than 10 pages, but those 10 pages are extremely important in getting the data out of your box. Do not fall into the trap of using a printf or rolling your own routine until you have completely exhausted your search of the documentation on your system.

14. Wooow, wait a minute, this is not my job and this is going to cost extra money and time, we are not doing this?

Here is an old architecture concept from a few decades ago. When you build something it needs to be PRODUCED, MAINTAINED, and yes TESTED. Design for produceability, maintainability, and testability may not appear to fit with current philosophies of lets get lean and mean, but if you want a successful project you better expand your horizons and you need to work together with all the stake holders on a project.

So yes, there is work that needs to be done. The work should not be short changed. Even though the logging function is very small, it can have a big impact on the device from Hiezenbergs' point of view. Yes we are perturbing the device, and yes Mr Hiezenberg did say something like you can never measure a thing because by the very nature of looking at it you have modified its true nature. However, engineering is about tradeoffs not the ideal. It is about approaching reasonable ideals.

So, whoever is tasked with getting the test data out of the box needs to research the computer and support software. In the case of the TI DSP, which was the target platform for the initial IAT, the TI folks created a service in their "operating system". The TI BIOS has a service that allows test data to be sent out the JTAG port with minimal overhead. The service is coincidentally called "Log_Event". When a Log_Event call is made only 40 machine cycles are executed as opposed to a printf which would take 40,000+ machine cycles. Also, no additional memory is needed from an include perspective. In the case of a printf, the STDIO library needs to be included which adds to the memory.

So, yes someone needs to determine the most effective mechanism to get the data out of the box. The mechanism needs to be very simple and take advantage of any unique services available from the hardware and support software.

15. You know, I did all this and I am getting 140 log events per second coming out of my box?

The IAT will filter the noise log events during data reduction, however, you messed up really bad. Remember you have to design for produceability, maintainability, and testability. Assuming you followed the log event placement rules and there are no log events in the real time path, there obviously are log events in some recurring path like a periodic self test path. Shame on you. You should have thought about this earlier and it should have been addressed in the design and definitely closed out properly during code reviews.

But here you are getting ready to test this beast and its bad. So what are the possible solutions?

IAT will allow you to de-instrument individual log events or whole groups of log events based on PUI or file names. The offending log events can be verified with special build(s) or with a different method. This will require 2 or more instrumented builds.
You can create your logging function with a bank of switches that can be enabled or disabled from the debugger / emulator or other source. During instrumentation, log events can be assigned groups of numbers like the 1000, 2000, 3000 series and the switch bank can enable disable log events in selected groups. So offending log events can be selectively turned on or off by the testers within the unique log event function during the test. Only 1 instrumented build is needed.
Your device can be given an operational capability to turn off certain periodic events like self tests. If you have a successful program, that probably will be eventually directed by the customer because such periodic processing also causes operational problems in complex systems (IMH experience).

Metrics Frequently Asked Questions

1. Why do we need metrics and how should we gather all this stuff?
2. Why not just use cost and schedule data - why gather all this other stuff?
3. The metrics the IAT gathers are trivial and useless for our real time systems so why do this to ourselves?
4. What data should we collect?
5. Ok, if I gather the metrics, who should look at these things?

1. Why do we need metrics and how should we gather all this stuff?

All organizations gather metrics, even if they are not written down. We gather metrics to try to figure out what is going on, if there are problems, and to try to make things easier / better. Why do I start off this way? Because many software organizations view metrics as an unnecessary burden taking away time from the "real work". These "evil" metrics provide no insight into the real problems and can be misused within an organization. So the answer is to avoid metrics and just code the software. It's important to understand this because any metric data that is gathered needs to be "correct" and common across "different" activities so that meaningful comparisons can be performed. In the ideal situation the metrics would be gathered transparently, in the background, with little or no human intervention. IAT does that for the collection of software and requirements that it is provided.

2. Why not just use cost and schedule data - why gather all this other stuff?

Bottom line is that cost and schedule are not good enough. More visibility is needed into an organization. Without this visibility the cost and schedule once blown is just that, blown. If finer grain data can be provided to show what may happen prior to losing the milestone, then corrective action can be taken sooner and less time and money might be wasted. A typical response is to develop plans with more milestones and shorter schedule cycles. This is called micro management and is usually a failure since its not possible to anticipate activities / tasks below a certain level. So the answer is not to develop "better micro" schedules! The answer is to gather metrics and look at the data.

3. The metrics the IAT gathers are trivial and useless for our real time systems so why do this to ourselves?

"The software folks are busy trying to get basic functionality into the box and chasing real time problems. We need better real time tools and this paper stuff will not help us."

These are the typical responses when the topic of software metrics surface. So what does gathering information on the number of missing curly braces have to do with the serious problems in the lab? Some folks think that if these basic kinds of software quality characteristics are missing, then its an indication of a stressed organization - an organization that is fighting with getting basic functionality into the box and getting it to work. The converse can be true where the team goes out of its way to make "pretty code" without any emphasis on having the box perform. However, if that approach is taken, testing and the metrics associated with testing should surface that "bad" approach. So its a balance. Don't go out of the way to stress that the code "be pretty" at the expense of working functionality. If this balance between the "software" and "test" metrics is maintained, then there is a reasonable set of software metrics. Remember that the test metrics counter balance the software metrics.

4. What data should we collect?

Tricky question. The problem is to minimize the noise so that the real information can be detected from the data. However, in one instance data considered noise becomes valuable information in another instance. So visualization is key to being able to reduce the noise level. What should be gathered? Anything that is easy to count on a consistent basis. The IAT has an initial set of metric item counters. If someone can identify an item to count, and the IAT can be easily modified, it should be added to the IAT.

5. Ok, if I gather the metrics , who should look at these things?

You should, along with your lead, and the TD. In fact, the leads and TDs should use them to help understand where there are potential issues. The metrics should also percolate to the organizational level. The organizational level should use this information to gain a better insight into how things are progressing, given that cost and schedule indicators are usually too late, they are just a confirmation of a problem. Many folks think that everyone should have access to this information and be excited about being able to interpret the data and that a mature organization (level 3 to 5) has metrics charts in almost every office :)

sat@cassbeth.com