How to increase Pageviews of Per visitor?

One post quoted from "http://www.webmasterworld.com/

well, stickiness is important for some web site, certain other types of web site don't require it.

Now what you really need is the following guild line pdf and you should be sailing perfectly

Google search for Matt cutler e-metrics white paper ( about 60+ pages long )

or go here

this should help you form a better question towards yourself, and then you can ask the question about stickiness

that book is my bible. I live by it's basic foundation rules.

now if you got the gut's ( and a decent volume web site )then you got to try out the following steps

1) speed the site up to 3 second max load time on a 14.4 dial up modem. don't forget every second lost due to slow design equals some cpu that could be used for the next page load.

2) color test: knowing what you want your user to become ( read chapter 4 ), try to test colors that improve your velocity ( chapter 10 )

3) test and risk, try to find your hot spot on your pages and maximise them. I had this web site that I sold off a long while ago, it's hot spot was lower left, till this day I can not figure it out, but the conversion were the best.

4) don't forget to drop some money on webmaster world subscription. it's worth it.

5) go to incredibill's blog, read the entire thing, when you get to his cat issues, start laughing, and learn about those evil scraper's and nasty robot's.

6) read markus007 comments from may 2004 onwards you'll see something interesting IE: every time Google makes a change in the targeting system for AdSense, first you don't make as much as you did, but oer time you catch up and surpass.


Testing and Practising with valuable free resources

We can get a lot of free resources from Internet. Including Free blog service, Free software, Opensource code. especially free web-hosting service.

I recommend a free web-hosting service provider here. It's 1majorhost

1Majorhost.com has some of the most features for free hosting web service on the internet. Features as the following:
  • 10Gb Storage Space
  • 99.98% Server Uptime
  • 65Gb Monthly Data Transfer
  • Linux Enterprise Operating Systems
  • FTP Supported
  • PHP 4 Supported
  • MySQL 4 Supported, 5 Databases
  • Rapid Servers with low load time
WOW, Amazing?

It also has a full functionality User Control Pane
l.You can operate web-based interface which has site stats, limits, traffic, sql and more.

1. Can manipulate databases online.
2. Can edit w
ebsite files in a WYSWYG editor.
3. Even can upload files in your browser.

4. Install some known web applications in time. Blog editor like wordpress, BBS like phpbb, CMS like Mambo and Joomla. You can get them in a few minutes.


Search engine optimization terminology(From IBM DW)

IBM developerWorks Supplied a series of SEO articles, Here is the terminology from the of that series.

SEO terminology

Here's the terminology you'll need to get started with this series.

A directory is a human-compiled search. Most directories rely on submissions instead of spiders.
Keywords, keyterms, and keyphrases
Keywords, keyterms, and keyphrases are the words you want your Web site to rank well for in the search engine results pages, also called SERPs. Depending on your audience, yours can be one word, a combination of words, or an entire phrase. To reduce word-bloat, I'll use the term keywords to encompass all three types.
Link farm
In SEO, a link farm is a page full of links that have very little to do with each other and exist just as links without any real context. People who practice black hat SEO use link farms to increase the number of links to a page in hopes of fooling Google™ into thinking the page is more link-worthy than it actually is.
Organic listings
Organic listings are the free listings in the SERPs. SEO for organic listings usually involves improving the actual content of your Web site, often at the page or infrastructure level.
PageRank is a measurement that the Google-obsessed use to test their rankings in Google. SEO and search engine marketing (SEM) professionals also use the term to describe your ranking in the SERPs and the ranking algorithm points given to your site by Google. No matter how you define it, PageRank is an important part of your SEO success.
Paid listing
Like the name says, paid listings are paid for in search engines. Depending on the search engine, a paid listing can mean paying for inclusion in the index, pay per click (PPC), a sponsored link, or other ways of making your site show up in the SERPs for targeted keywords and phrases.
A ranking is where your page is listed in the SERPs for your targeted keywords. The goal of SEO is high rankings for the keywords that your Web pages target.
Ranking algorithm
A ranking algorithm is the set of rules that a search engine uses to evaluate and rank the listings in its index. The ranking algorithm is what determines which results are relevant to a specific query.
Search engine marketing (SEM)
SEM is used interchangeably with SEO, but SEM often refers more to marketing your Web site to the search engines through paid placement and ads, as well as using SEO techniques.
Search engine optimization (SEO)
SEO is creating Web pages that are picked up by the search engines through optimizing your content for search engine attractiveness and visibility. SEO is mostly used to increase the rankings of your organic listings. I'll use the term SEO to describe the techniques I recommend, although many of these techniques also fall under the umbrella of SEM.
Search engine results page (SERP)
SERPs are the listings, or results, displayed for a particular search. SERP is sometimes defined as search engine results placement. For the purposes of this series, I'll refer to it as a page rather than a placement. In the world of SEO, a good showing in the SERPs is what it's all about.
Spamming is a method of SEO that attempts to trick a spider and scam loopholes in the ranking algorithm to influence rankings for targeted keywords. Spamming can take many forms, but the most simple definition for spam is any technique a Web site uses to misrepresent itself and influence ranking. The two methods of SEO are based on whether you want to spam or not.
  • Black hat SEO: Spamming the search engines. Black hat SEO is lying, cheating, and stealing your way to the top of the SERPs.
  • White hat SEO: Optimizing your site so it serves the user, as well as attracts spiders. In white hat SEO, anything that leads to a good user experience is considered also good for SEO.
A spider crawls through the Web looking for listings to add to a search engine index. It is sometimes referred to as a Webcrawler, robot, or bot. When optimizing your page for organic listings, you are catering to the spider.


Interview with Matt Cutts on Search and SEO in China

Interview with Matt Cutts on Search and SEO in China

Note:This articled is copied from http://www.chinamyhosting.com/seoblog/2007/04/10/interview-matt-cutts-en/

Original Autor:ZAC

April 10, 2007

Matt Cutts mentioned in his blog on Mar 17, 2007, “I still have an email interview with a blogger that I’m trying to finish that started in September 2006″.

So here you go, it’s finished :-)

In the interview with Matt Cuts about search and SEO in China, Matt and his “top Chinese webspam engineer”, Jianfei,answered my long list of questions with great tips and insights.

It is helpful to all SEOers and online marketers.

Special thanks to Philipp.

Chinese version is here.

Zac: First of all thank you guys for doing this interview with me, I believe it will be very helpful for SEOers and web marketers in China.

There are currently lots of misunderstandings about SEO in China. The first thing that pops up in mind is “spam” when people hear the word SEO. Some say “SEO is shortsighted and is like suicide”. From search engine’s point of view, is that true? Is SEO hated, allowed or encouraged by Google? We’re talking about whitehat SEO here.

Matt: It’s a common mistake to think that search engines don’t like SEO. The fact is that SEO within Google’s quality guidelines is okay. That includes things like making sure that your site is crawlable, thinking of words that users would use when searching and including them naturally within the content of the site, and doing things like making sure that page titles and urls are descriptive.

What Google (and other search engines) don’t like is when someone tries to cheat or take a short cut to show up higher than they should. When a site violates our quality guidelines, Google calls that spam.

Zac: Google announced its official Chinese name “Gu Ge” (Harvest Song) in April 2006 however the majority of Chinese users do not seem like the new name.

According to China Internet Network Information Center, CNNIC, Google is losing market share from 33% last year to current 25.3%.


What do you think of the market share drop?

Jianfei (朱健飞): For the market share, let’s refer the statement from Kaifu Lee, the president of Google China office. “To some extent, the survey could have some errors. Different users have different frequencies of using search engines. People may use search engines 10 times a day, while other people may use search engines once a day. Simple sampling methods may not show the real traffic of different search engines.”

Zac: I noticed there are Chinese employees in Google headquarter. Any idea how many Chinese in Googleplex now? How are they doing? Any advice for Google fans who want to join Google?

Jianfei: We do have many Chinese engineers at the Googleplex. They are doing great. You can visit http://www.googlechinablog.com/ and read some Chinese engineers’ articles about their life at Google.

For Google fans who want to join Google, they can go to http://www.google.cn/jobs/ and check available jobs. If they can not join Google, they still can give us suggestions and ideas. Their support is important to us. For reporting spam sites, they can go to http://www.google.cn/contact/spamreport.html.

Matt: In fact, if you sign up for Google’s Webmaster Central at http://www.google.com/webmasters/ , you can also use the form at
to report spam. In addition, if you don’t want to sign up for Google account, you can also report spam here:

However, I recommend that you use one of the first two links. Google gives more weight to spam reports that are done with our Webmaster Central.

Zac: Let’s talk about duplicate content, which is a hot topic recently.

I see much more content copying on Chinese web sites. Many Chinese webmasters like to “gather” contents from other web sites, either using software or by hand, then publish on their own web sites. Does Google penalize these sites full of contents you can see everywhere? Is there a percentage or threshold, exceeding which penalty is applied?

What should the original author do so that the original is recognized as so?

Jianfei: We have noticed that some Chinese web sites have a lot of duplicate content. Users like to get different search results, so Google is looking at how best to provide diverse results. Our algorithms already have some ways of removing duplicate content, and we will continue to look for ways to improve.

Zac: Some web sites use multiple domains with exactly same content , for example, domain.com and domain.com.cn. Is this risky? What’s the best way to do it?

Matt: If the content is truly the same, I would pick one domain and make the other domains do a redirect to the domain you prefer. For example, google.com could do a permanent (301) redirect to www.google.com, and then we would see that and generally choose the destination of the redirect. Having content from two different domains isn’t risky if they are in different languages (for example, Chinese and English), but if you have the exact same content on two different domains, it’s better to use a permanent redirect from the duplicate domains to a single preferred domain.

If you have mirror pages without a redirect, that can cause issues. It’s better to use 301/permanent redirects, because Google might choose to remove or not to show the copy that you liked the best.

Zac: I have been talking about good original content in my blog and the message is well received by SEOers in China. However the problem is, as many readers ask me, my company sells, say a “glass edge grinding machine”, it’s simply boring, what interesting content can I write about it? Could you give some tips in content development for this type of highly specialized products?

Matt: Don’t forget that creativity can really help. For example, there was a site that made industrial blenders, which sounds like a very boring subject. But now go watch this video: http://www.youtube.com/watch?v=aM94aorYVS4 and you’ll see something amazing. They threw all kinds of different objects into the blender to prove how powerful their machine was.

It’s true that heavy machinery or industrial sites might sound boring at first, but by looking for a creative angle, you can often raise interest in your company. Even things like newsletters, blogs, information about an industry, or other resources can serve as a reason for people to get interested in your site and link to you.

Zac: The highest PR we can find on Chinese web site is PR8. Is there discrimination against Chinese sites in terms of PR? If not, why don’t we see PR9 or even better PR10 Chinese sites? Does PR still matter for ranking in the first place?

Matt: PageRank does depend on the link structure of the web, but I wouldn’t be discouraged if you don’t see PR9 and PR10 sites. For one thing, Chinese sites are usually only ranking against other similar Chinese sites, so the playing ground is level. It’s also important to remember that Google has a finer scale to measure PageRank (not just from 1 to 10), so even two different sites that both have a PR6 in the toolbar can actually have different PageRanks.

The fact is that Google does special work to help measure reputation in non-English languages.

Zac: Is there a significant difference between Chinese site SEO and English site SEO? Are there differences in your algorithm for different languages?

Matt: I think that every country does have some differences in how they do SEO. In Germany, people are more likely to use hyphens in their domain names, for example. Some countries lean more toward monetizing via affiliate programs; other countries may monetize more via cell phones than credit cards, because credit cards aren’t equally common in every country. But there are many common ways that SEOs operate.

Jianfei: One main difference between Chinese site SEO and English site SEO is the set of queries they are working on. For example, “viagra” is one of the most spammy queries for English, while “手机铃声” (ringtone) is a more spammy query for Chinese. Another difference is that almost all mid- or large-sized Chinese domains have blogs, which is not the case for other languages.

Zac: Adwords users in other countries normally sign up for Adwords account directly with Google. However Google takes different approach in China, partnering with Adwords agents, kind of localized way as all PPC providers in China do the same.

Why did Google choose this localized approach and did not stick to your direct relationship with advertisers which has been proven to be successful worldwide? Do you consider your Adwords program in China a success?

Matt: I’m sorry to say that I’m not an AdWords expert, but I do know that Google tries to adapt to each market and present products in the way that works best for every country. I’m proud of the AdWords team, and I think that they’re doing a really good job in China.

Zac: Some SEOs believe that freshness plays an important role in Google ranking. Many think blogs are easier to rank better due to freshness. Yet some SEOs think it’s not a good idea to tweak web pages frequently.

What do you suggest? Update web pages often, or no?

Matt: It depends on the industry that you are in. I would do whatever makes the most sense for your users. Just changing a page more often for the sake of having a page change is probably not very productive. But if you have a blog, then posting more often might attract more users. So for some people it might make sense to change the page less often (a manufacturing company, for example), while for some people it will make sense to change the page often to attract more visitors (e.g. if you are a blogger).

Zac: Is SEO service a reliable business model that you would recommend to SEOers in China? I ask this because there’s very very few established and reputable SEO companies in China. Many companies claim they provide SEO services but what they actually do is spamming forums and blogs.

Ethical individual SEOs are struggling to survive.

I believe you know plenty of successful SEO companies. In China, do we have a future ahead of us in SEO industry? How can we grow, from individual to reputable SEO company?

Jianfei: I think if SEOs can follow Google’s quality guidelines, then that SEO can have great future. Search engine results are important to the industry, and there can be a lot of market demand for an ethical company.

Matt: If you are considering using an SEO, it’s very important to think about the long-term. It doesn’t help to get a spike in users if Google or other search engines will find spam and remove a site. One thing you can do is ask for references or see if a company can provide success stories where the SEO provided stable long-term traffic. http://www.google.com/support/webmasters/bin/answer.py?answer=35291&hl=cn is a good document to read about how to research SEOs. Unfortunately, there are some SEO services that will spam if you hire them, and you should try to avoid them in the first place.

Zac: There’re lots of talk about trusted domain and authority sites. If a site is considered authority, it will be ranked higher in Google, more people find it and more links, then it becomes even stronger.

How should mom-and-pop sites overcome this situation and compete with authority sites? Besides building a great site with tons of useful original contents, is there a shortcut?

Matt: I wouldn’t try to tackle a huge keyword if you’ve just created your small mom-and-pop business. Instead, concentrate on a smaller niche where you can get to be known as an expert. As you get to be more well-known, then you can work from the smaller niche up to bigger and bigger areas. Many successful sites start out small and then build their way up. Also, the more creative or funny or helpful you can be, often that will help people become aware of you faster.

Jianfei: For example, your site http://www.chinamyhosting.com/seoblog/ , is such a site. One year ago, the site was not as well-known. But through your hard work and creative effort, now it ranks well for the query [搜索引擎优化排名] (search engine optimization) which is an impressive feat.

Zac: Have you ever been to China? If you do plan to visit China, there’re thousands of fans who would like to meet you in person. :)

Jianfei: I was born in China, in the last year I’ve been to China twice, plus I enjoy working with people at the Google China office.

Matt: I’m sorry to say that I’ve never been to China. My mother has been to China several times, including Yangshuo (Guilin), and my wife has been once, and they both speak a little Chinese even though they’re both American. So clearly I need to work on getting over there; I hear that it’s an amazing country, so I’d love to visit some day.

I won’t be able to make it to SES China this year, but I’m really excited that I think Jianfei or another Google representative will be able to represent Google at SES China. Jianfei is a top-notch member of the webspam team and he’s a much smarter expert on Chinese webspam than I am. :)

Zac: Everyday I see link spams in my blog. Will link spam in blogs and forums cause penalty or they are simply ignored by Google therefore have no effects on ranking?

Jianfei: Actually, it can be dangerous to do link spam. If Google finds a company is doing link spam, it may remove the company’s site from our index. Google may not re-include the site unless we don’t see the spam links anymore. In most case, removing links is even more difficult than adding links (e.g., the links posted on blogs, BBS by spamware), so it’s better to stay away from link spam.

Matt: Usually Google is good enough that we just try to ignore link spams. When we can tell that a company did link spams, we can take appropriate action.

Zac: Another topic in all SEO forums and blogs is supplemental result. If more and more pages of a domain are dumped into supplemental result, does it mean the domain is losing trust? Would you worry about supplemental result if your SEOer instead of Googler?

Matt: I wouldn’t worry about supplemental results. If your site has lower PageRank then it may occur in our supplemental index, but that doesn’t mean that the site has a penalty or is losing trust. Usually that just means that if you get a few more high-quality links because your site is good, then we will include more pages from your site in our main web index.

In addition, we have been getting better at refreshing our supplemental index more often and showing those results to more users, so webmasters can often start to see more traffic coming to supplemental results pages now.

Zac: Baidu is your biggest competitor in search market of China. It’s said that they have better search technologies than Google in certain fields such as Chinese word segmentation.

On the other hand, Google has been recruiting top talent in China. I read somewhere that the engineering team at Google China has yet contributed much to the core ranking algorithm. Do you plan to localize the algorithm to better suit Chinese language? What’s your technical advantage compared with Baidu?

Jianfei: As a matter of policy, we don’t comment on specific competitors. We welcome competition that helps deliver useful information to users and expands user choice. Having great competitors is a huge benefit to us and everyone in the search space - it makes us all work harder and at the end of the day our users benefit from that.

Matt: We don’t talk much about our ranking because it’s confidential, but the China office has contributed in several ways to how Google does ranking. In fact, some really nice applications such as http://www.google.cn/rebang/home are seen in China before other places. That’s a brand-new product developed in China.

Zac: If you don’t mind, Matt, are you GoogleGuy at WebmasterWorld as hinted? Google is doing great job communicating with webmasters and we appreciate. Is there any chance that an engineer in Google China team can take similar role to communicate actively with Chinese webmaster community?

Matt: I don’t think we’ve confirmed the official identity of GoogleGuy, and that’s okay because it means that if GoogleGuy ever needs to take a break, someone new can come in to help communicate. The truth is that I get more credit than I deserve. A lot of communication in English happens from a lot of people: Vanessa Fox, Adam Lasnik, and many, many others.

And in Chinese, I’m very lucky to work with a great team of people such as Jianfei, plus other wonderful people in Mountain View (California USA) and Beijing. My guess is that over time, Google will begin to communicate more and more with Chinese webmasters. This joint interview is a good step forward.

Zac: There’re debates in China, what role should SEO play in the bigger picture for web sites? Is SEO an important part of web marketing and ecommerce? Some web marketers think SEO is piece of cake, write title tag, add keywords here and there, things like that.

Do you think nowadays SEO has gone one step further and act as kind of web marketing consultant? In other words, SEOers should help clients streamline online sales process, find target market, content development, user experience, viral marketing, etc. This is the concept I’m trying to spread. In the end, users need a great site, not great code.

Matt: I agree that SEO in many cases is about making a great site, not just getting the web design or the code just right. SEO does include getting the right tags and code in place, but that’s just the first step. If you can come up with a great viral marketing campaign or something that gets people talking about your site with word of mouth, that’s SEO as well, and is a much better way to get links than trying to use spamware programs, for example.

In many ways, SEO is about making sure that users have a great experience, because if you make a great site, that’s going to help a site rank better in search engines naturally.

Zac: Do you foresee big changes in terms of SEO in the coming few years?

Matt: I think personalization and localization are big trends. If we can return different results for the same query because Google knows a little more about you, that may be a really big quality win for users. That will make SEO a little harder, but SEOs who care about long-term value will be quite happy about personalization, because they’ll get visitors who are more interested in their site, and those visitors may convert into buyers.

Jianfei: Of course, Google is also going to continue to pay a lot of attention to quality and SEO. Over time, I think Chinese SEOs will find that it’s easier to make great sites that agree with our quality guidelines, because Google will continue to work hard to stop spam.

Zac: Do you like Chinese food?

Matt: I love Chinese food! I hear that Chinese food in the United States isn’t quite the same though, so maybe some day I’ll get a chance to experience real Chinese food. I’d like to try some Peking duck, for example. Thank you for asking these interesting questions!

Zac: Thank you Matt and Jianfei.

Matt: Thank you! We’d love for people to report Chinese spam at
and to use our webmaster tools as well. There’s also a lot of information for Chinese webmasters at http://www.google.cn/support/webmasters/ . We’ll also continue to listen to Chinese webmasters and try to respond.

We enjoy reading at sites like http://www.seobbs.net/ , http://www.dunsh.org/ and of course we also enjoy http://www.chinamyhosting.com/seoblog/ :)


Top Infomation Retrieval Related Components of Chinese Language

At latest, Chinese lexical analysis system ICTCLAS (Institute of Computing Technology, Chinese Lexical Analysis System) update to 3.0. The function of ICTCLAS includes word segmentation, Part-Of-Speech tagging ,unknown words recognition and user custom dictionary.

ICTCLAS3.0, the speed of word segmentation can be up to 996KB/s,precision can be up to 98.45%. But the size of API Pack is only 200KB. After compressed, the file size of all dictionaries only 3M. It must be the best Chinese lexical analysis system", quoted from the official website of the component

You can integrated that API in your Java, C#, C/C++ Applications. The downloaded package contains samples of these languages both on Linux and Windows.

The trial version of that package can be download from the official site. The site also provides some other Chinese language processing tools for downloading. I interested one of them that named Web pages body extraction component.

At last ,Give you the website address:http://www.i3s.ac.cn
By the way : i3c means "Division of Information Intelligence and Information Security"


Build Lucene 2.1.0 from Scratch in Eclipse Under Windows

Main steps:
1、Download JDK from http://java.sun.com/
2、Install JDK.
3、Download source code Lucene 2.1.0 from its official site: http://lucene.apache.org/
4、Extract files from archieve package to D:\test\lucene-2.1.0(I choose this directory only for demonstration, you can change to anywhere you like,but you must make corresponding changes below).

5、Got Eclipse from http://www.eclipse.org/
And extract it to D:\Eclipse

6、launch your Eclipse. And then click menus or buttons as following:
File=>New=>Project...(Select Java->Java Project, Then click Next)=>Select the option "Create project from existing place" and then press Browse button to select the project directory. I set it to D:\test\lucene-2.1.0, And set the project name "lucene-2.1.0". Click Next=>Finish.
7、Now, You will got the new project named "lucene-2.1.0" in your Package Explorer.
Eclipse IDE will build the Java files automatically and there are may be some errors.Please don't mind about that.Just continue.
8、Select the project "lucene" in your workspace. the
click menus or buttons as following:
Run=>External tools=>External tools...
The wizard has already configured the build task for you? Right.
You can see one node named "lucene" in the Ant Build.
Press Run you will launch the build process.
At last, I got the messages as I show in the last article in Eclipse Console.

9、May be you will build failed. Please check JAVA_HOME environment variable. It must point to your Java Development Kit directory. If not,set one.
Or change the JRE of Ant in External tools wizard. I do not encounter this. So I do not show how to solve it in detail. If you have any question. Feel free to contact me or comment this article.

Build Lucene 2.1.0 from scratch under Linux

In this Linux version, all operations are excecuted by command line.
And I will also give a Windows version build.
The build process on Windows can be finished the same as on Linux using command line. But I will show how to get the last result using Eclipse and Ant integrated with Eclipse.

Main steps:
1、Download JDK from http://java.sun.com/
2、Install JDK.
3、Download source code Lucene 2.1.0 from its official site: http://lucene.apache.org/
4、Extract files from archieve package.
tar -zxvf lucene-2.1.0.tar.gz

5、Got build tool Ant from http://ant.apache.org/

if you already have an old version of Ant(before 1.6.2, like 1.5.2-23), the build process will be failed. The error message is:
file:/usr/home/test/code/lucene-2.1.0/build.xml:7: Unexpected element "import"
please use "ant -v " to check your ant version.

The lastest Ant version is 1.7.0, I download the zip file and use it to show how to finish the left build steps.
wget http://apache.mirrors.redwire.net/ant/binaries/apache-ant-1.7.0-bin.zip

Now, It's time to build! Change your directory to lucene directory: /usr/home/test/code/lucene-2.1.0
For instance, ant is under the code directory. use command

If no exception ,you will got the following messages:
Buildfile: build.xml





[echo] Clover not found. Code coverage reports disabled.


[mkdir] Created dir: /usr/home/test/code/lucene-2.1.0/build/classes/java
[javac] Compiling 204 source files to /usr/home/wuxing/code/lucene-2.1.0/build/classes/java
[javac] Note: * uses or overrides a deprecated API.
[javac] Note: Recompile with -Xlint:deprecation for details.

[rmic] RMI Compiling 1 class to /usr/home/test/code/lucene-2.1.0/build/classes/java

[jar] Building jar: /usr/home/test/code/lucene-2.1.0/build/lucene-core-2.1.1-dev.jar


Total time: 10 seconds

Easy? Are you got it,haha