data preprocessing
Home > SQL Server Definitions - Data preprocessing
SearchSQLServer.com Definitions (Powered by WhatIs.com)
EMAIL THIS
LOOK UP TECH TERMS Powered by: WhatIs.com
Search listings for thousands of IT terms:
Browse tech terms alphabetically:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z #

data preprocessing



Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   

DEFINITION - Data preprocessing describes any type of processing performed on raw data to prepare it for another processing procedure. Commonly used as a preliminary data mining practice, data preprocessing transforms the data into a format that will be more easily and effectively processed for the purpose of the user -- for example, in a neural network. There are a number of different tools and methods used for preprocessing, including: sampling, which selects a representative subset from a large population of data; transformation, which manipulates raw data to produce a single input; denoising, which removes noise from data; normalization, which organizes data for more efficient access; and feature extraction, which pulls out specified data that is significant in some particular context.

In a customer relationship management (CRM) context, data preprocessing is a component of Web mining. Web usage logs may be preprocessed to extract meaningful sets of data called user transactions, which consist of groups of URL references. User sessions may be tracked to identify the user, the Web sites requested and their order, and the length of time spent on each one. Once these have been pulled out of the raw data, they yield more useful information that can be put to the user's purposes, such as consumer research, marketing, or personalization.

LAST UPDATED: 01 Apr 2005

Read more about data preprocessing:
- A paper at DePaul University discusses data preprocessing in "Automatic Personalization Based on Web Usage Mining."


Do you have something to add to this definition? Let us know.
Send your comments to techterms@whatis.com


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


RELATED CONTENT
Utilize SSAS for data predictions and classification using Excel
Using Excel as a tool for data mining and predictions is very powerful. Learn how to take advantage of SSAS in SQL Server 2005 with Microsoft Office...
Tutorial: SQL Server Integration Services (SSIS) best practices
Learn the best practices for SQL Server Integration Services (SSIS) in this tutorial, from migrating DTS packages to SSIS debugging, maintenance and...
FAQ: Troubleshooting SQL Server Reporting Services
Troubleshoot SQL Server Reporting Services (SSRS) issues such as configuration, report designs and importing and exporting report data using Excel.

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
comma-separated values file  (SearchSQLServer.com)
data aggregation  (SearchSQLServer.com)




data preprocessing Solutions - SQL White Paper Library
HomeNewsTopicsITKnowledge ExchangeTipsAsk the ExpertsMultimediaWhite PapersIT Downloads
About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides enterprise IT professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective IT purchase decisions and managing their organizations' IT projects - with its network of technology-specific Web sites, events and magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2005 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts