[mono-vb] WebCrawler in vb.net (mono)

Rafael Teixeira monoman at gmail.com
Fri Feb 19 06:20:09 EST 2010

There's already a Lucene port to .NET: http://lucene.apache.org/lucene.net/

Although you are posting in mono-vb list, I ,particularly, wouldn't use or
recommend VB.NET for that (mono's vb compiler is VB.NET 2.0 [VB8) compatible
Try C# 4.0 (dmcs compiler on Mono) or Boo, as the main language. (your arms,
wrists and fingers will thank you later).


Rafael "Monoman" Teixeira
"To be creative means to be in love with life. You can be creative only if
you love life enough that you want to enhance its beauty, you want to bring
a little more music to it, a little more poetry to it, a little more dance
to it."

2010/2/18 quandary <quandary82 at hailmail.net>

>  I've wanted to do that a long time ago.
> You can take a look at Apache Lucene, a Java search library, which you
> could port to .net.
> Perhaps you find a way to compile the lucene library from java
> source/bytecode directly to .net.
> Another way is to extend this codeproject project:
> http://www.codeproject.com/KB/IP/Crawler.aspx
> Then you need a ranking algorithm, such as Google PageRank, or perhaps
> better something like Yahoo TrustRank, and a parallel computation library,
> and a cluster software for computing the Eigenvectors of the markov chains
> (indexing).
> I found this site about PageRank to be particularly useful because of it's
> incredible simplicity:
> http://www.peterbe.com/PageRank-in-Python
> On 02/17/2010 03:21 PM, Mauro Risonho de Paula Assumpção wrote:
> I am developing an open source software, which need a web crawler. I would
> like help from the list. The idea is to scan the structure of the site
> (HTTP and HTTPS), riding in a treeview in vb.net with GTK (Mono). Does
> anyone have any ideas?
> Thanks
> _______________________________________________
> Mono-vb mailing listMono-vb at lists.ximian.comhttp://lists.ximian.com/mailman/listinfo/mono-vb
> _______________________________________________
> Mono-vb mailing list
> Mono-vb at lists.ximian.com
> http://lists.ximian.com/mailman/listinfo/mono-vb
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.ximian.com/pipermail/mono-vb/attachments/20100219/c4103681/attachment.html 

More information about the Mono-vb mailing list