<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Tech &#124; Data &#124; Thoughts - azeem.fm</title>
	<atom:link href="http://www.azeem.fm/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.azeem.fm</link>
	<description>by Azeem Ansar</description>
	<lastBuildDate>Thu, 06 May 2010 15:46:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0-beta1</generator>
		<item>
		<title>PyQuery over BeautifulSoup</title>
		<link>http://www.azeem.fm/2010/05/05/pyquery-over-beautifulsoup/</link>
		<comments>http://www.azeem.fm/2010/05/05/pyquery-over-beautifulsoup/#comments</comments>
		<pubDate>Wed, 05 May 2010 21:50:22 +0000</pubDate>
		<dc:creator>Azeem Ansar</dc:creator>
				<category><![CDATA[Code]]></category>
		<category><![CDATA[code]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[scraping]]></category>

		<guid isPermaLink="false">http://www.azeem.fm/?p=46</guid>
		<description><![CDATA[I was trying to scrape some search data the other day, and I ran into some malformed tag issues with BeautifulSoup, which is what I&#8217;ve used in the past. I didn&#8217;t really need much &#8220;power-scraping&#8221; though, just wanted to collect text from div tags across a few pages of search results. Also, I think BeautifulSoup [...]]]></description>
			<content:encoded><![CDATA[<p>I was trying to scrape some search data the other day, and I ran into some malformed tag issues with <a title="BeautifulSoup" href="http://www.crummy.com/software/BeautifulSoup/">BeautifulSoup</a>, which is what I&#8217;ve used in the past. I didn&#8217;t really need much &#8220;power-scraping&#8221; though, just wanted to collect text from div tags across a few pages of search results. Also, I think BeautifulSoup isn&#8217;t maintained much anymore, so I started looking for something different.</p>
<p>Then I found <a title="PyQuery" href="http://pyquery.org/">PyQuery</a>, which is great; I guess I&#8217;m new to it. For simple scraping, it&#8217;s a great way to go; the point of PyQuery is to make jquery queries on xml; I found it to be pretty handy for getting what I wanted pretty quickly. I wanted to pull text so I did:</p>
<blockquote><p>&gt;&gt;&gt; d = pq(myURL)<br />
&gt;&gt;&gt; myText = d(&#8216;div.sampleClass&#8217;) # this&#8217;ll give you every div that you want of sampleClass<br />
&gt;&gt;&gt; myText.eq(&#8216;0&#8242;).text()<br />
&#8216;Hello this is sample text 0!&#8217;<br />
&gt;&gt;&gt; myText.eq(&#8216;1&#8242;).text()<br />
&#8216;Hello this is sample text 1!&#8217;</p></blockquote>
<p>So if you&#8217;re trying to pull text across search pages like I was, and the content is structured enough, you can just loop through every URL, and then loop further through every instance of &#8220;sampleClass&#8221;, saving your text along the way.</p>
<p>I&#8217;m new to this (and new to python in general, to be quite honest), so I found it pretty cool! Please let me know if you have any other tools or PyQuery specific tricks that could be interesting.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.azeem.fm/2010/05/05/pyquery-over-beautifulsoup/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Joining a Startup Out of College is &#8220;Hard&#8221;</title>
		<link>http://www.azeem.fm/2010/04/22/startup-out-of-college/</link>
		<comments>http://www.azeem.fm/2010/04/22/startup-out-of-college/#comments</comments>
		<pubDate>Thu, 22 Apr 2010 14:52:22 +0000</pubDate>
		<dc:creator>Azeem Ansar</dc:creator>
				<category><![CDATA[Startups]]></category>
		<category><![CDATA[college]]></category>
		<category><![CDATA[startup]]></category>
		<category><![CDATA[venture capital]]></category>

		<guid isPermaLink="false">http://www.azeem.fm/?p=12</guid>
		<description><![CDATA[Venture-driven junior recruitment could be a simple solution to an obvious problem of confusion and misdirection, for countless talented undergrads. Especially nowadays, with so many 22 year olds graduating not only jobless, but clueless as well, what a great time for funds to invest in expanding the future human capital of technology.]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.cdixon.org">Chris Dixon</a> posted an article a few months ago <a href="http://cdixon.org/2010/02/11/every-time-an-engineer-joins-google-a-startup-dies/">on the lack of fresh grads joining startups</a>, where he ended with:</p>
<blockquote><p>&#8220;Whenever I see a brilliant kid decide to join Goldman Sachs, McKinsey, or Google, I think to myself: a startup just died, and as a result our world is a little less wealthy, innovative, and interesting.&#8221;</p></blockquote>
<p>I agree that this is an issue. I was not &#8220;brilliant&#8221; enough for a Goldman, McK, or Google; but <a title="Houlihan Lokey" href="http://www.hl.com">I did decide to become an investment banker</a>. My reasons were simple: (perceived) security, exceptional compensation, and &#8220;prestige&#8221;, which is really the same thing as security. When I looked at startups (which I did, as <a href="http://www.linkedin.com/in/azeemansar">I had failed at starting a company in college</a>), I did not see those things, and that scared me awayt. I also did not have much to look at; <a title="University of Pennsylvania" href="http://www.upenn.edu">my alma mater</a> was a great conduit for finance jobs, but finding opportunities at startups was a challenge.</p>
<p>Obviously, I am not the only one who has had this issue; I get emails everyday from undergrads and recent alums in the exact same situation, with the exact same confusion, and without the frothy job market of 2006.</p>
<p>Fortunately, the solution seems to be in slow effect: a friend of mine from Penn recently told me that <a title="Andreessen Horowitz" href="http://a16z.com/">Andreessen Horowtiz</a> was recruiting on campus, not for the fund itself, but for junior positions in their portfolio companies. After hearing that, I thought it was just brilliant; venture funds should be hiring junior talent directly. If I was hired by <a title="Sequoia Capital" href="http://www.sequoiacap.com">Sequoia</a> for a junior BD or PM role at a startup, or even a rotational program between Sequoia&#8217;s portfolio companies, I might have foregone wall street altogether:</p>
<p><strong>Brand Value</strong>: for undergrads, especially at &#8220;higher tier&#8221; schools, brand-name resume builders are craved. Having venture funds recruit and directly hire for startups solves this problem, as in the example above: being hired by Sequoia into some kind of &#8220;startup-development&#8221; program. Turn it into a rotational program, and the brand+experience is more valuable than any investment banking or consulting program (especially for applying to business school, which is an appealing &#8220;hook&#8221; for most).</p>
<p><strong>Compensation</strong>: this needs to be on par with other, more immediately attractive career paths. Foregoing the wall street bonuses is obvious, but comparable salary? Maybe student loan subsidizing? Paying student loans is a common reason for going after the larger compensation packages wall street offers, so getting help here would be another attractive hook.</p>
<p><strong>Security</strong>: the perceived security that comes with &#8220;Sequoia&#8221; is higher to most of the students in question, than that of Sequoia&#8217;s portfolio companies (at least for the most part). This is optics, but optics matter; the rather hefty influx of resumes Andreessen Horowtiz received (or so I heard) attests to this.</p>
<p>Venture-driven junior recruitment could be a simple solution to an obvious problem of confusion and misdirection, for countless talented undergrads. Especially nowadays, with so many 22 year olds graduating not only jobless, but clueless as well, what great timing (and messaging) for funds to invest in expanding future human capital.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.azeem.fm/2010/04/22/startup-out-of-college/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Blog going live</title>
		<link>http://www.azeem.fm/2010/04/20/test-1/</link>
		<comments>http://www.azeem.fm/2010/04/20/test-1/#comments</comments>
		<pubDate>Tue, 20 Apr 2010 08:41:25 +0000</pubDate>
		<dc:creator>Azeem Ansar</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Test]]></category>

		<guid isPermaLink="false">http://www.azeem.fm/?p=9</guid>
		<description><![CDATA[I made a new years resolution of publishing more, and I&#8217;ve been wanting to for some time. I think this new platform better suits it; still semi-under construction, check out my tumblr page in the interim.]]></description>
			<content:encoded><![CDATA[<p>I made a new years resolution of publishing more, and I&#8217;ve been wanting to for some time. I think this new platform better suits it; still semi-under construction, check out <a title="Azeem Ansar's tumblr page" href="http://azeemansar.tumblr.com">my tumblr page</a> in the interim.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.azeem.fm/2010/04/20/test-1/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Hello world!</title>
		<link>http://www.azeem.fm/2010/04/19/hello-world/</link>
		<comments>http://www.azeem.fm/2010/04/19/hello-world/#comments</comments>
		<pubDate>Mon, 19 Apr 2010 20:28:21 +0000</pubDate>
		<dc:creator>Azeem Ansar</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.azeem.fm/?p=1</guid>
		<description><![CDATA[Welcome to WordPress. This is your first post. Edit or delete it, then start blogging!]]></description>
			<content:encoded><![CDATA[<p>Welcome to WordPress. This is your first post. Edit or delete it, then start blogging!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.azeem.fm/2010/04/19/hello-world/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
