Database Research & Development

  • Home
  • NoSQL
    • NoSQL
    • Cassandra
  • Databases
    • Database Theory
    • Database Designing
    • SQL Server Coding Standards
    • SQL Server
    • PostgreSQL
    • MySQL
    • Greenplum
    • Linux
  • Interviews
    • SQL Server Interviews
    • MySQL Interviews
    • SQL Puzzles
  • DBA Scripts
    • SQL Server DBA Scripts
    • PostgreSQL DBA Scripts
    • MySQL DBA Scripts
    • Greenplum DBA Scripts
  • Home
  • Blog Archives !
  • (: Laugh@dbrnd :)
  • Contact Me !
sqlserverinterviews
Home 2017 June Greenplum: How to find Skewness of table (Skew of data)?

Greenplum: How to find Skewness of table (Skew of data)?

This article is half-done without your Comment! *** Please share your thoughts via Comment ***

The Greenplum is a based on MPP (Massive Parallel Processing) architecture.
There are multiple segments running in nothing shared mode that means your data should equally distribute across all segments.

If table data is not equally distributed, we cannot achieve the good performance of parallel processing system.

The Skewness of the table means that table data is not equally distributed across the segments and workload is not divided properly between the segments.

You can find skewness of data by checking gp_segment_id for each record.

The record count of segments should be very near to each other like 90% to 95%, and if you find a big difference in a count or 0 counts for few segments that mean your data is not properly distributed.

1
2
3
SELECT gp_segment_id, count(*)
FROM table_name
GROUP BY gp_segment_id;

Other two gp_toolkit views to get the information of Skewness of data:

gp_toolkit.gp_skew_coefficients: This view shows data distribution skew by calculating the coefficient of variation (CV) for the data stored on each segment.
gp_toolkit.gp_skew_idle_fractions: This view shows data distribution skew by calculating the percentage of the system that is idle during a table scan, which is an indicator of processing data skew.

Jun 30, 2017Anvesh Patel
Greenplum: How to check the Database Schema Size?Greenplum: Script to find the usage of Work spill file per Running Query

Leave a Reply Cancel reply

CAPTCHA
Refresh

*

Anvesh Patel
Anvesh Patel

Database Engineer

June 30, 2017 Greenplum, Greenplum DBA ScriptAnvesh Patel, database, database research and development, dbrnd, Greenplum, Greenplum Administrator, Greenplum Error, Greenplum Monitoring, Greenplum Performance Tuning, Greenplum Programming, Greenplum Query, Greenplum Tips and Tricks, PostgreSQL 8.2, Skew of data), Skewness
About Me!

I'm Anvesh Patel, a Database Engineer certified by Oracle and IBM. I'm working as a Database Architect, Database Optimizer, Database Administrator, Database Developer. Providing the best articles and solutions for different problems in the best manner through my blogs is my passion. I have more than six years of experience with various RDBMS products like MSSQL Server, PostgreSQL, MySQL, Greenplum and currently learning and doing research on BIGData and NoSQL technology. -- Hyderabad, India.

About DBRND !

dbrnd

This is a personal blog (www.dbrnd.com).

Any views or opinions represented in this blog are personal and belong solely to the blog owner and do not represent those of people, institutions or organizations that the owner may or may not be associated with in professional or personal capacity, unless explicitly stated.

Feel free to challenge me, disagree with me, or tell me I’m completely nuts in the comments section of each blog entry, but I reserve the right to delete any comment for any reason whatsoever (abusive, profane, rude, or anonymous comments) - so keep it polite.

The content of this website is protected by copyright. No portion of this website may be copied or replicated in any form without the written consent of the website owner.

Recent Comments !
  • Anvesh Patel { Sure will do... } – May 27, 12:43 PM
  • Anvesh Patel { Great... } – May 27, 12:41 PM
  • Anvesh Patel { Great... } – May 27, 12:39 PM
  • Anvesh Patel { Great... } – May 27, 12:36 PM
  • Anvesh Patel { Great... } – May 27, 12:28 PM
  • Anvesh Patel { Great... } – May 27, 12:27 PM
  • Anvesh Patel { Great... } – May 27, 12:16 PM
  • Older »
Follow Me !
  • facebook
  • linkedin
  • twitter
  • youtube
  • google
  • flickr
© 2015 – 2019 All rights reserved. Database Research & Development (dbrnd.com)
Posting....