HyperLink
HomeServicesSolutionsNews & EventsCompany Login
TestProNews Q3 - 2006
File vs Database Options for Test Data Management  

One of the major challenges that companies face when testing their products is managing the large quantities of test data generated by automated test systems. The role of a test data management (TDM) solution is to solve this problem. TDM systems capture, aggregate, and organize test data making it available for analysis and reporting. A good understanding of the architectures available in TDM systems is essential because it determines the security, expandability and performance of the TDM system, as well as the efficiency of your organization when using the system. The architecture of TDM systems can be grouped into two categories
(Figure 1) : 

    1. Test data is stored in individual files
    2. Test data is stored in a database 


Figure 1: The architecture of TDM systems can be grouped into two categories: individual file based architecture or database architecture

With a file-based TDM architecture, you store test data in standard flat file formats, use common-purpose search tools to find and index the data, and import the disparate test data into specialized analysis software (Figure 2).

The most common tools used today in this flat-file TDM architecture rely on the Microsoft Windows file system for storing data, Windows or Google desktop search to find data, and Microsoft Excel to analyze and report on the data.

The skills required to implement this type of TDM system are the basic computer software skills held by test engineers and technicians. The cost for this system is very low - in the $1k or less range.


 Figure 2: With a file-based TDM system, you store test data in standard flat file formats, use search tools to find the data, and import disparate test data into specialized analysis software.

The benefit of this type of system is that it is simple to implement and use. Test software can easily be configured or programmed to create flat files. You then use either the same or similar searching features for your test data as you would use to find other files on your PC. Data security is controlled in the same way you share files on your PC over the network relying on built-in Windows security.

The disadvantage of a file-based system is that its simplicity results in a lack of capabilities needed to deliver the efficiency of use, security, and performance of a large or expandable TDM system. Because these systems are built on the Windows operating system (OS) instead of a secure database and appropriate IT infrastructure, none of the advantages of enterprise database technologies can be readily leveraged for test.

One of the biggest disadvantages that users of automated test systems experience are the inefficiencies associated with retrieving data and reports from the system. Because the data is not organized in a consistent manner or in a format that facilitates drilling through multiple files, finding, correlating, and comparing your data can be a tedious process even with the greatest search capabilities. This becomes even more of a problem as data is spread out over different test stations.

The file-based TDM system is also inefficient because of the time it takes to aggregate data from different files. For example, one of VI Technology’s customers in the semiconductor industry originally opted for a file-based system. However, once the test department started acquiring large amounts of test data, they realized that the time required for the computer to retrieve the data from each of the test stations, aggregate it, and create a report took over twenty four hours per report. In order to reduce this time and increase the department’s efficiency, VI Technology deployed a database TDM system that reduced the required time to create the same reports down to seconds.

Additional deficiencies of file-based data management systems include limitations in accessing test data and reports, which directly affect usability for the user. For example, accessing data stored off-site with a file based system is usually hindered by firewalls, disconnected systems, slow transfer speeds, unreliable networks, etc. Therefore, data access is frequently restricted to repositories inside the company intranet or possibly only on the test station itself. In addition, the Windows operating system and many common applications used for analysis and reporting do not elegantly handle concurrent data access on the same file. Therefore, you will find instances in which you are unable to access data, that data is lost, or a file is corrupted due to a user accessing a file at the same time that a test station is writing data to the file.

Data security is another significant disadvantage for the file-based system because the security features are limited to what the built-in Windows OS security system can provide. Windows security must be granted to an individual at the folder level. This means that once you give a user a certain level of access privileges to a folder, those privileges permeate and apply to all subfolders and data in that folder. You can potentially solve this problem by breaking up test files into multiple folders, but obtaining a useful level of granularity for even a modest number of files and users usually becomes very complex. This complexity increases as the number of folders grows.

For example, some folder-based data management layouts rely on first breaking up test results into folders by day. Then, within each folder you would create subfolders for each type of product tested. If you have 3 products and 5 users, you would find yourself wrestling with setting up security privileges approaching 500 entities in the first month alone. This security maintenance problem is more extreme when the data files are stored on multiple test stations because you now must go to each test station and set the different privilege levels for each user.

As an attempt in solving this problem, test engineers often recruit assistance from the IT department in order to set up folders on a server. However, the test engineer will often run into problems and delays as they now have to involve IT every time they want to make changes to privileges. This eliminates one of the key reasons the file-based system was chosen in the first place: to avoid IT involvement.

You may also find that creating dynamic reports based on data stored in folders is a seemingly viable option. However, problems with a Windows-based file system often occur because you may set the privileges correctly for a user on the report folder, but the user of the report may not have the correct privileges on all the folders holding the data the report needs to access. If you need to reconfigure the structure of your data because you add a different type of user, you now have to also re-organize the test data into different folders which breaks the links of the reports to the test data.

File-based systems also pose significant limitations in terms of establishing company-wide standards for data file formats. As such, the problem of data chaos continues to grow over time as each new test developer creates the test data files in his or her own unique format without any constraints of a vision for organized test data.

Because the file-based system relies on common search tools and specialized analysis tools, anyone needing to create a report on the test data must have some understanding of the data locations and how to use the analysis tool. This requirement prevents engineering managers from easily running needed reports without learning the lower level engineering file details and tools.

Even with these limitations, the file-based TDM system does have its place for small test setups or bench-top systems. The simplicity of the file-based system makes it a viable system for a single test station, with a very small number of UUTs, running only a few tests per UUT. File-based systems are appropriate for labs and small design groups, but not for manufacturing operations, as these require a different database TDM system.

Database TDM Systems
The main architecture of a database TDM system supports the capture of test data into a unified, industry-standard database and utilizes ubiquitous, readily-available Web technologies through enterprise TDM software to access data for analysis and reporting.

The most common tools used today in this TDM architecture are a Microsoft SQL Server database, Arendar TDM software, and Microsoft Internet Explorer.

The skills required to deploy this type of database TDM system are basic database installation skills. The only skill required to use this system is the ability to use a standard Web browser. The typical entry cost of this kind of system is now under $20k.

A key benefit to the database TDM system is that it leverages all the benefits of commercial, enterprise-grade databases and technologies into test. This system overcomes all the limitations previously described that are specific to a file-based system. The efficiency of using such a system increases dramatically because all data is organized in an industry-standard relational database. Aggregating and correlating data to extract information from different tests is an implicit and automatic task with near-immediate response thanks to database technologies such as Online Analytical Processing (OLAP.)

Access to data and reports is available to anyone with the appropriate security and user privileges from any location as long as they have network access and a Web browser. A file-based system typically requires IT department involvement to open ports or set correct security permissions on folders residing in corporate servers. In comparison, a database TDM system does not require IT involvement in order to configure who can access test data and reports.


Figure 3: The most common tools used today for an enterprise TDM system include Microsoft SQL Server, Arendar TDM software, and Microsoft Internet Explorer.

Database TDM technologies are designed to handle concurrent access and complex security requirements. As such, the problems of data loss, corruption, and access privileges are eliminated.

The learning curve required for a user to access data and reports is much shorter with a database TDM system. The database TDM system provides an all-in-one environment integrating search and analysis capabilities through a familiar Web browser interface. However, users still have the option of using familiar scientific or specialized analysis software with the database TDM system enabling all required analysis capabilities beyond what the product offers out of the box.

The database TDM system establishes a framework for data organization, such that the data structure becomes standardized throughout the company. With a database TDM system, the problem of chaotic data is solved once in a simple yet elegant fashion.

The most common concern about database TDM systems is the upfront product and installation costs. Before the introduction of enterprise TDM software, such as Arendar, development time and costs were prohibitive for most companies. But with today’s latest enterprise TDM software, customers are regularly up and running within days and the costs, which include the industry/standard database, are well within most test department budgets.

The database TDM system is the most efficient and scalable solution for all but the smallest test applications. Once you have a large number of test results, multiple test stations, multiple users viewing and consuming test data, or multiple people viewing test reports, the database TDM system becomes the most cost effective and requires the least amount of time to maintain.

In terms of manufacturing operations, the database TDM system is the only viable system given the sheer quantities of data involved. For contract manufacturing operations, only the database system can provide the required security and data accessibility. The database TDM system is ideal for design and manufacturing, and as such your company can use the same TDM tool in design and manufacturing, even if you outsource your manufacturing operations to a contract manufacturer.

Best of Both Worlds
Both the file-based TDM system and the database TDM system suit specific applications and each have benefits. If you want a simple yet rudimentary system to find and analyze your data in the design lab, then the file-based system is appropriate. If you want a professional, scalable, and efficient system to securely manage your data with universal accessibility, a database TDM system is the most appropriate (Figure 4).


Figure 4: Comparison of File Based and Enterprise Database Options for Test Data Management

Ask the Expert
Email questions concerning this article to:
expert.testpronews@vi-tech.com

TestProNews
Read about the latest information for integrating automated test systems throughout the enterprise.
TestProNews  Download PDF









Have questions?
Contact us at
info@vi-tech.com
or call toll-free
1-888-689-4415
header
   Legal  Site Map  Contact Us © 2001-2008 VI Technology, Inc. All Rights Reserved.
footer