How to find the DataNodes that actually store a file in HDFS?

Posted on

A file may be splitted to many chunks and replications stored on many datanodes in HDFS. Now, the question is how to find the DataNodes that actually store a file in HDFS? You may use the dfsadmin -fsck tool from the Hadoop hdfs util. Here is an example: $ hadoop fsck /user/aaa/file.name -files -locations -blocks
Read more

How to write /etc/fstab entry for –bind mounting?

Posted on

How to write /etc/fstab entry for –bind mounting like mount –bind /home/hadoop/hdfs/store-tmp /home/store/tmp From man 8 mount: Since Linux 2.4.0 it is possible to remount part of the file hierarchy somewhere else. The call is mount –bind olddir newdir or shortoption mount -B olddir newdir or fstab entry is: /olddir /newdir none bind

How to increase the number of files allowed to be opened on Linux?

Posted on

On my system: $ ulimit -n 1024 Some tools like GATK are aggressive in creating temporary files by creating more than 1000 files under /tmp/. This will cause the program to fail. How to increase the number of files allowed to be opened on Linux? To increase the max number of open files to 10240,
Read more

How to change number of replications of certain files in HDFS?

Posted on

The HDFS has a configuration in hdfs-site.xml to set the global replication number of blocks with the “dfs.replication” property. However, there are some “hot” files that are access by many nodes. How to increase the number of blocks for these certain files in HDFS? You can the replication number of certain file to 10: hdfs
Read more

How to get logs of a specific time range on Linux?

Posted on

The logs I am processing is Hadoop log (log4j). It is in format like: 2014-09-20 21:55:11,855 INFO org.apache.hadoop.nfs.nfs3.IdUserGroup: Updated user map size: 36 2014-09-20 21:55:11,863 INFO org.apache.hadoop.nfs.nfs3.IdUserGroup: Updated group map size: 55 2014-09-20 22:10:11,907 INFO org.apache.hadoop.nfs.nfs3.IdUserGroup: Update cache now 2014-09-20 22:10:11,907 INFO org.apache.hadoop.nfs.nfs3.IdUserGroup: Not doing static UID/GID mapping because ‘/etc/nfs.map’ does not exist. Now, I
Read more

Making Hadoop Java process heap larger?

Posted on

In Hadoop 2.5.0, I use ‘ps -aux’ and find the Java process has options: -Xmx1000m However, my nodes have 32GB memory. How to make Hadoop Java process heap larger? In yarn-env.sh, you can find: # For setting YARN specific HEAP sizes please use this # Parameter and set appropriately # YARN_HEAPSIZE=1000 In hadoop-env.sh, you can
Read more

Finding All Available Versions of a Package in Ubuntu

Posted on

How to find all available versions of a package in Ubuntu? To list available versions of a package: apt-cache showpkg <package-name> For example, to check all versions of thunderbird: $ sudo apt-cache showpkg thunderbird Package: thunderbird Versions: 1:31.1.1+build1-0ubuntu0.14.04.1 (/var/lib/apt/lists/archive.ubuntu.com_ubuntu_dists_trusty-updates_main_binary-amd64_Packages) (/var/lib/apt/lists/security.ubuntu.com_ubuntu_dists_trusty-security_main_binary-amd64_Packages) Description Language: File: /var/lib/apt/lists/archive.ubuntu.com_ubuntu_dists_trusty_main_binary-amd64_Packages MD5: 68ed1001b79d708ad48956a0c129114d Description Language: en File: /var/lib/apt/lists/archive.ubuntu.com_ubuntu_dists_trusty_main_i18n_Translation-en MD5: 68ed1001b79d708ad48956a0c129114d 1:31.0+build1-0ubuntu0.14.04.1 (/var/lib/dpkg/status)
Read more

How to back up emails from the email server?

Posted on

offlineimap actually does “synchronization”—if an email is deleted from the server, the email will be deleted from the local Maildir (http://offlineimap.org). This is good for mail sync. However, for backing up emails,one would like to keep the email even if it is deleted from the server. How to achieve this? You may use `getmail` which
Read more

How to backup iPhone on Linux?

Posted on

How to backup iPhone on Linux? Not using iTunes or Windows. You can backup iPhone using libimobiledevice. By the time I replied here, it supports up to iOS 7. Please check this post for a tutorial. Actually, the methods introduced can be used on Linux systems with the libimobiledevice installed. Backup iPhone: idevicebackup2 backup ~/iPhoneBackups/
Read more

How to find the history of updated packages by apt-get or aptitude?

Posted on

How to find the history of updated packages by apt-get or aptitude? The history is in log files of dpkg and apt: /var/log/dpkg.log /var/log/apt/history.log /var/log/aptitude To check the recently installed packages: cat /var/log/dpkg.log | grep ” install ” To list history of recently installed packages by apt-get: cat /var/log/apt/history.log | grep ” install ” To
Read more

Wireless driver in Linux Mint for HP Mini 110 Netbook

Posted on

How to install the wireless driver in Linux Mint for HP Mini 110 Netbook? The bcmwl-kernel-source for Broadcom Linux STA driver works. You may need to connect the network first by the LAN cable or another wireless adapter. Open Control center -> System, Driver Management. After a while, it will ask you to select the
Read more

Manage Linux console screen by commands?

Posted on

How to manage Linux console screen by commands? When the screen will be blanked? Put the screen into powerdown mode or power off the screen? 2 tools are useful for managing the console screen on Linux: setterm – set terminal attributes.vbetool – run real-mode video BIOS code to alter hardware state. When the screen will
Read more

How to monitor temperatures of laptop on Linux

Posted on

How to monitor temperatures of laptop on Linux? This works on Linux Mint: sudo aptitude install lm-sensors hddtemp For lm-sensors, first detect the sensors by: sudo sensors-detect To detect the temperature in the system: sudo sensors To detect the HDD (e.g. sda) temperature: sudo hddtemp /dev/sda An example of the output: [zma@mini:~]$ sudo sensors acpitz-virtual-0
Read more