Home IT Linux Windows Database Network Programming Server Mobile  
           
  Home \ Server \ Hadoop new and old version of the difference in the size of the InputSplit     - MySQL use benchmarking tool sysbench (Database)

- Source compiler install Nginx (Server)

- How to Upgrade Ubuntu GNOME 14.10 to GNOME 3.16 Desktop (Linux)

- Debian SSD ext4 4K aligned (Linux)

- To create a secure network firewall with iptables Under Linux (Linux)

- Installing software on Ubuntu: apt-get and dpkg difference (Linux)

- How to make GRub instead of the default Ubuntu software center (Linux)

- Timeout control related to Python threads and a simple application (Programming)

- Merge sort Java implementation (Programming)

- Iptables on the request URL for IP access control (Linux)

- Java threads in the life cycle (Programming)

- Comparison of sorting algorithms (Programming)

- Cacti monitoring service Nginx (Linux)

- IOS interview questions Summary (Programming)

- C # function (Programming)

- Incremental garbage collection mechanism for Ruby 2.2 (Programming)

- Five strokes to find out the IP address you want to know (Linux)

- CentOS permanently banned from running in the background PackageKit (Linux)

- MySQL5.6.12 Waiting for commit lock lead to hang from the library housing problem analysis (Database)

- CentOS and RHEL to install IPython 0.11 (Linux)

 
         
  Hadoop new and old version of the difference in the size of the InputSplit
     
  Add Date : 2018-11-21      
         
       
         
  The number of InputSplits in a previous version of Hadoop is determined by the following three parameters:

GoalSize: totalSize / numSpilt.totalSize for the file size, numSplit map task for the user to set the number, the default is 1.

MinSize: The minimum value of InputSplit, which is set to 1 by the configuration parameter mapred.min.split.size.

BlockSize: The size of the block in HDFS.

SplitSize = max (minSize, min (goalSize, blockSIze))

New:

MaxSize: determined by the configuration parameter mapred.max.split.size, has no longer consider the number of user-set map task.

MinSize: The minimum value of InputSplit, which is set to 1 by the configuration parameter mapred.min.split.size.

BlockSize: The size of the block in HDFS.

SplitSize = max (minSize, min (maxSize, blockSIze))
     
         
       
         
  More:      
 
- Linux system installation Gitlab (Server)
- MySQL group_con cat_max_Len (Database)
- Linux resource restriction level summary (Linux)
- How to enhance the security of Linux systems (Linux)
- Camera-based face recognition OpenCV crawl and storage format (Python) (Linux)
- mysqldump issue a note (Database)
- VMware virtual machine Ubuntu install arm-linux-gcc cross-compiler environment (Linux)
- Regular expressions in Perl (Programming)
- MySQL Parameter Tuning Best Practices (Database)
- Upgrading KDE Plasma 5.3 in Ubuntu 15.04 (Linux)
- How to upgrade to Ubuntu 14.04 Ubuntu 14.10 (Linux)
- Linux NIC driver and version information (Linux)
- CentOS 6.5 can not connect to the network under VMware (Linux)
- TCP network communication Java Programming (Programming)
- Repair CentOS 6.4 Grub boot (Linux)
- Improve the Ubuntu SSH login authentication approach speed (Linux)
- SSH without password Definitive Guide (Linux)
- Protect against network attacks using Linux system firewall (Linux)
- Cool Android realization SVG animation (Programming)
- C ++ free store and heap (Programming)
     
           
     
  CopyRight 2002-2016 newfreesoft.com, All Rights Reserved.