Chapter 5: Understanding Data || Informatics Practices (IP) || Class 11th || NCERT CBSE || NOTES IN ENGLISH || 2024-25

   


Chapter 5: Understanding Data 

5.1 Introduction to Data

  • Definition of Data: Data refers to raw facts, numbers, or characters that represent information. People rely on data to make decisions, like selecting a college based on placement records or analyzing previous game scores to strategize.

  • Examples of Data Use: Governments collect census data to make policies; banks store account details to manage transactions; companies analyze customer preferences; and weather agencies monitor data to predict weather patterns.


5.1.1 Importance of Data

  • Decision-Making: Data helps in making informed decisions by revealing patterns, trends, and insights that may not be obvious initially.

  • Examples in Different Fields:

    • Banks update account balances based on transaction data.

    • Meteorological departments track satellite data for cyclone warnings.

    • Businesses use sales data to adjust pricing or discounts based on demand.


5.1.2 Types of Data

Data comes in various formats and is generally categorized into two main types:

(A) Structured Data

  • Definition: Organized data stored in a fixed format, like tables with rows and columns.

  • Examples: Customer databases, spreadsheets of school records, and inventory lists.

  • Usage: Structured data is easily managed and analyzed with tools like spreadsheets or databases.

(B) Unstructured Data

  • Definition: Data without a predefined format, often inconsistent, like text from emails or social media posts.

  • Examples: Images, videos, social media posts, and news articles.

  • Metadata: Information about unstructured data, like file size or type, helps in organizing it.


5.2 Data Collection

  • Definition: Gathering data from various sources for analysis and processing.

  • Examples of Collection Sources:

    • Sales data in stores, kept in registers or digital formats.

    • Social media posts to gauge public opinion.

    • Economic data collected by organizations like the World Bank.

  • Purpose: Collected data can provide insights, like a grocery store finding that certain products are frequently bought together, guiding their product placement strategy.


5.3 Data Storage

  • Definition: Storing data on physical or digital devices to ensure it is accessible for future use.

  • Storage Devices: Common digital storage devices include hard drives, SSDs, CDs/DVDs, memory cards, and cloud storage.

  • Examples:

    • Images, documents, and videos stored as files on a computer.

    • School databases for student records.

  • Limitations of File Storage: Managing large volumes of data through files alone is inefficient, leading to the need for Database Management Systems (DBMS).


5.4 Data Processing

  • Purpose: Raw data needs to be processed to extract meaningful information that aids in decision-making.

  • Steps in Data Processing:

    • Data Collection: Gathering the data.

    • Data Preparation and Entry: Organizing and inputting data.

    • Processing: Analyzing data through calculations or categorization.

    • Output: Results presented in the form of reports, tables, or charts.

  • Examples:

    • Banks processing ATM transactions by verifying balance and printing a receipt.

    • Examination boards processing student data to generate admit cards.


5.5 Statistical Techniques for Data Processing

Statistical techniques are essential for summarizing data and understanding its characteristics. Common techniques include:

5.5.1 Measures of Central Tendency

  • Mean: The average of a data set, calculated by adding all values and dividing by the total number. Used to find general trends but sensitive to extreme values.

  • Median: The middle value in a sorted list, representing the central point of the data. Median is unaffected by extreme values and is ideal for skewed data.

  • Mode: The most frequently occurring value in a dataset. Useful in identifying the most common element in non-numeric data.

5.5.2 Measures of Variability

  • Range: Difference between the highest and lowest values, showing the spread of the data.

  • Standard Deviation: Measures how much data varies from the mean, providing insight into data consistency. A low standard deviation means data is closely grouped around the mean, while a high value indicates more spread.


Summary

  • Data: Represents unorganized facts that can be processed into useful information.

  • Types of Data: Can be structured (easily organized) or unstructured (lacks a specific format).

  • Data Storage: Stored digitally on devices like hard drives, USBs, or cloud storage.

  • Data Processing Cycle: Involves inputting, storing, processing, and outputting data.

  • Statistical Techniques: Tools like mean, median, mode, range, and standard deviation help in data summarization and analysis.



0 comments: