Create and Manage Virtual Machines on Xen

Posted on

In this post, these content are introduced: Create and manage file-backed virtual block device (VBD) for virtual machines on xen. Install Fedora 11 via internet as DomU on top of xen. Manage virtual machines using xm. Create file-backed VBD: The actual space of VBD will be the amount of disk the virtual machine used. And
Read more

Creating and Running Virtual Machine Using VMware Player and qemu-img

Posted on

基于VMware Player, qemu-img创建和运行虚拟机 Linux系统上的免费个人虚拟机方案. 1. 创建虚拟硬盘映象文件: qemu-img create -f vmdk WindowsXPPro.vmdk 10G 2. 创建.vmx虚拟机配置文件: 这是一文本文件. WindowsXPPro.vmx内容: config.version = “8” virtualHW.version = “3” ide0:0.present = “TRUE” ide0:0.filename = “WindowsXPPro.vmdk” memsize = “256” MemAllowAutoScaleDown = “FALSE” ide1:0.present = “TRUE” ide1:0.fileName = “auto detect” ide1:0.deviceType = “cdrom-raw” ide1:0.autodetect = “TRUE” floppy0.present = “FALSE” ethernet0.present = “TRUE” usb.present =
Read more

Moved back to WordPress from MediaWiki

Posted on

WordPress is so missed for many great features and plugins. Hence, we moved the site back to the WordPress system on Jul. 12, 2013. MediaWiki is great but, for this site, WordPress is a better solution. The so missed features of WordPress Related posts via the YARPP Plugin. URLs without strongly mapped to the title
Read more

How to Run a cron Job Every Two Weeks / Months / Days

Posted on

We may want to run some jobs for every two weeks/months/days… under some situation such as backing up for every other week. In addition, we may add more complex rules for running jobs, e.g. run a command when the load of the server is higher than a certain level. With the help of the shell
Read more

How to Compress/Uncompress Files in Linux Using gzip, bzip2, 7z, rar and zip

Posted on

Compress/uncompress files are frequent operations. The normal tools for compressing/uncompressing in Linux is gzip, bzip2, 7z, rar and zip. This post introduces how to compress and uncompress file in Linux using these tools. We use best compressing rate with all these tools and mark the options for “best rate” in bold fonts. We can delete
Read more

Hadoop TeraSort Benchmark

Posted on

TeraSort is one of Hadoop’s widely used benchmarks. Hadoop’s distribution contains both the input generator and sorting implementations: the TeraGen generates the input and TeraSort conducts the sorting. Here, we provide a short tutorial for using the Hadoop TeraSort benchmark. TeraGen generates random data that can be used as input data for a subsequent running
Read more

Reading List for Distributed Systems and Cloud Computing

Posted on

Understanding the literature is usually the first step to do research, which is the same for systems research on cloud computing. A reading list may help a lot to those that just start in cloud computing research. Prof. Lin Gu, my PhD supervisor, compiled a reading list for system research on cloud computing. The reading
Read more

Setting Up a Git Server Using Gitosis

Posted on

Update: Since gitosis is not maintained and supported, please check out gitolite for setting up a new git server. (see the comment from Sitaram Chamarty, the gitolite author, the author of gitolite.) Gitosis is a piece of software writen by Tommi Virtanen for hosting git repositories. It manages multiple repositories under the same user account.
Read more

Hadoop Default Ports

Posted on

Hadoop’s namenode and datanodes expose a bunch of TCP ports used by Hadoop’s daemons to communicate to each other or listen directly to users’ requests. These ports information are needed by both the Hadoop users and cluster administrators to write programs or configure firewalls/gateways accordingly. A post written by Philip Zeyliger from Cloudera’s blog summarizes the
Read more

A Simple Sort Benchmark on Hadoop

Posted on

After [[hadoop-installation-tutorial|installing Hadoop]], we usually run some benchmark programs to test whether the system works well. In the post of the Hadoop install tutorial, we show a very simple to grep strings from a simple sets of files. In this post, we introduce the Sort for testing and benchmarking Hadoop. The Sort program is also
Read more

Pitfalls and Lessons on Configuing and Tuning Hadoop

Posted on

This post lists pitfalls and lessons learning when configuring and tuning Hadoop. Hadoop with IPv6 Hadoo doesn’t support IPv6 currently (up to 0.20.2 and 0.21.0): Hadoop and IPv6. The performance of the cluster may suffer from turning IPv6 on in clusters: mail archive. One good practice is to disable IPv6 on servers in the Hadoop
Read more

Setting Up Standalone (Local) Hadoop

Posted on

Hadoop is designed to run on [[hadoop-installation-tutorial|hundreds to thousands of computers]] inside cluster. However, Hadoop is configured to run things in a non-distributed mode as a single Java process by default. This is specially useful for debugging since distributed debugging is really a nightmare. This post introduces how to set up a standalone Hadoop environment.
Read more