[mono-vb] WebCrawler in vb.net (mono)

Rafael Teixeira monoman at gmail.com
Fri Feb 19 06:20:09 EST 2010


There's already a Lucene port to .NET: http://lucene.apache.org/lucene.net/

Although you are posting in mono-vb list, I ,particularly, wouldn't use or
recommend VB.NET for that (mono's vb compiler is VB.NET 2.0 [VB8) compatible
only].
Try C# 4.0 (dmcs compiler on Mono) or Boo, as the main language. (your arms,
wrists and fingers will thank you later).

:)

Rafael "Monoman" Teixeira
---------------------------------------
"To be creative means to be in love with life. You can be creative only if
you love life enough that you want to enhance its beauty, you want to bring
a little more music to it, a little more poetry to it, a little more dance
to it."
Osho


2010/2/18 quandary <quandary82 at hailmail.net>

>  I've wanted to do that a long time ago.
>
> You can take a look at Apache Lucene, a Java search library, which you
> could port to .net.
> Perhaps you find a way to compile the lucene library from java
> source/bytecode directly to .net.
>
> Another way is to extend this codeproject project:
> http://www.codeproject.com/KB/IP/Crawler.aspx
>
> Then you need a ranking algorithm, such as Google PageRank, or perhaps
> better something like Yahoo TrustRank, and a parallel computation library,
> and a cluster software for computing the Eigenvectors of the markov chains
> (indexing).
>
> I found this site about PageRank to be particularly useful because of it's
> incredible simplicity:
> http://www.peterbe.com/PageRank-in-Python
>
>
>
> On 02/17/2010 03:21 PM, Mauro Risonho de Paula Assumpção wrote:
>
> I am developing an open source software, which need a web crawler. I would
> like help from the list. The idea is to scan the structure of the site
> (HTTP and HTTPS), riding in a treeview in vb.net with GTK (Mono). Does
> anyone have any ideas?
>
> Thanks
>
>
> _______________________________________________
> Mono-vb mailing listMono-vb at lists.ximian.comhttp://lists.ximian.com/mailman/listinfo/mono-vb
>
>
>
> _______________________________________________
> Mono-vb mailing list
> Mono-vb at lists.ximian.com
> http://lists.ximian.com/mailman/listinfo/mono-vb
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.ximian.com/pipermail/mono-vb/attachments/20100219/c4103681/attachment.html 


More information about the Mono-vb mailing list