|
|
|
|
|
|
|
|
|
Home > Latest Press Releases > [Software Release] Link Extractor - Open Source |
[Software Release] Link Extractor - Open Source |
|
|

Download LinkExtractor_executables.zip - 194.5 KB
Download LinkExtractor_project.zip - 136.7 KB
Link Extractor Introduction
Link Extractor is a program that extracts all the links from a web page you specify and lists them in left-bottom-pane. And this program shows the source of the webpage to you in right pane.
Nowadays, Link Extractor is usually used as one of SEO tools in the Internet. I hope this tool shoulde help you when your webpages go to SEO. Link Extractor is a freeware.
Using the code in Link Extractor
To get HTML Document, GetHtmlDocument() is used.
m_pHTMLDocument3 = GetHtmlDocument();
m_pHTMLDocument3 = GetHtmlDocument();
((CMainFrame*)AfxGetMainWnd())->GetLinkListView()->GetListCtrl().DeleteAllItems();
m_pCollection = m_pHTMLDocument3->getElementsByTagName(L"A");
for(long i=0; ilength; i++)
{
m_pElement = m_pCollection->item(i, (long)0);
if(m_pElement != NULL)
{
//second parameter says that you want to get text inside attribute as is
BSTR bstr; CString str;
m_pElement->get_innerText( &bstr );
str = bstr;
((CMainFrame*)AfxGetMainWnd())->GetLinkListView()->AddLink(
(LPCTSTR)bstr_t(m_pElement->getAttribute("href", 2)),
(LPCTSTR)bstr_t(m_pElement->getAttribute("title", 2)),
str.operator LPCTSTR()
);
}
To get the html source from a web page
IHTMLDocument2 *lpHtmlDocument = NULL;
LPDISPATCH lpDispatch = NULL;
lpDispatch = GetHtmlDocument();
if(!lpDispatch)
return false;
lpDispatch->QueryInterface(IID_IHTMLDocument2, (void**)&lpHtmlDocument);
ASSERT(lpHtmlDocument);
lpDispatch->Release();
IHTMLElement *lpBodyElm;
IHTMLElement *lpParentElm;
lpHtmlDocument->get_body(&lpBodyElm);
ASSERT(lpBodyElm);
lpHtmlDocument->Release();
// the parent of Body is HTML
lpBodyElm->get_parentElement(&lpParentElm);
ASSERT(lpParentElm);
BSTR bstr;
CString str;
lpParentElm->get_outerHTML(&bstr);
str = bstr;
((CMainFrame*) AfxGetMainWnd())->GetHttpSourceView()->AddSource( str );
lpParentElm->Release();
lpBodyElm->Release();
Points of Interest
Anybody can extract all website links from a web page.
Source : http://www.softahead.com
Keywords : Link Extractor, SEO tool, Link, Html Source Viewer, Html Source
|
|
|
Latest Press Releases
[View Press Releases List]
|
Lose weight without stuggling.
Date : 2009-06-20 08:22:45
This is the worlds miracle berry. Include Acai Berry in your diet and start your Miracles today. Get your Free Trial Now! More info here - http://www.sfanyy.cn
|
Feel great with Acai Flush.
Date : 2009-06-18 18:42:08
Loaded with Antioxidants Acai berry will make your body a fat burning factory. Get your Free Trial Now! More info here - http://www.sfanyy.cn
|
WooSnap Does What Yahoo and Google Cant
Date : 2009-06-04 03:33:40
New web searching tool allows users to choose multiple sites and search them with a single click.
Manassas, VA (PRWEB) May 20, 2009 -- WooSnap 2.0, a new Internet search control product developed by Bid Solve Inc., allows the user to choose multiple sites and search them for key words with a single click, something that no other browser does. W...
|
Aspose.Newsletter: Convert Excel file to PDF using Aspose.Cells
Date : 2009-06-03 03:32:52
Aspose publishes June 2009 Newsletter for .NET & Java programmers by highlighting the newly supported features by Aspose.Pdf.Kit, Aspose.Editor, Aspose.Words and Aspose.BarCode for Reporting Services. We will also provide information about migrating your code for MS Office Automation to Aspose using Aspose.Cells. You will learn about the latest new...
|
Export Word to PDF docs compliant to PDF/A-1b via Aspose.Words
Date : 2009-05-23 05:40:22
What’s new in this release?
Earlier, Aspose.Words for .NET version 6.3 supported glossary documents while loading or saving Office Open XML only. But the newer version 6.4 enables exporting Word documents to PDF compliant to PDF/A-1b and Aspose.Words team is also committed to support PDF/A-1a in the next few months as well.
Example: How to ex...
|
Localize Aspose.AdHoc to any language using Custom Localization
Date : 2009-05-11 18:00:01
What’s new in this release?
By default AdHoc is translated to three languages: English, Arabic and Dutch (Nederland). Now this version enables you to create custom localization resource sets based on a template provided in Downloads section of the web site.
More about Aspose.AdHoc for .NET
Aspose.AdHoc is an ad hoc query builder component ...
|
|
|
|
|
|
|
|