Category Archives: Sh*ts

Retrieve all links from a html page/file with PHP

 

$html = file_get_contents('path/to/file.html');
// Create a new DOM document
$dom = new DOMDocument;

// Parse the HTML. Use the @ to avoid any parsing errors if the $html string isn't valid XHTML
@$dom->loadHTML($html);

// Get all links 'a' we can use also 'img', 'li, 'table', etc to extract other tags.
$links = $dom->getElementsByTagName('a');

foreach ($links as $link){
    echo $link->nodeValue;
    // Full node
    echo $link->getAttribute('href'), '<br />';
    // the link itself
}

Google Web Search API limitation

 

Free quota

Usage is free for all users, up to 100 queries per day.

Paid Usage

Any usage beyond the free usage quota will fail if you are not signed up for billing. Once you have enabled billing, you will continue to receive 100 free queries per day. However, you will be billed for all additional requests at the rate of $5 per 1000 queries, for up to 10,000 queries per day. If you need additional quota, please request additional quota from the console.

OpenStack Havana Demo

Mark Collier, COO of the OpenStack Foundation, shows off some of the new services enabled by OpenStack Havana such as Metering and Orchestration.

OpenStack Havana

OpenStack Havana, the eighth release of the open source software for building public, private, and hybrid clouds, has nearly 400 new features to support software development, managing data and application infrastructure at scale. The OpenStack community continues to attract the best developers and experts in their disciplines with 910 individuals employed by 145 different organizations contributing to the Havana release.

Learn all about Havana at the Hong Kong OpenStack Summit

 

16 Free Online Business Courses That Are Actually Worth Your Time

Yale: Financial Markets with Bob Shiller

Platform: Coursera

Stanford: Entrepreneurship Through The Lens Of Venture Capital

Platform: Stanford/ITunesU

Stanford: Game Theory

Platform: Coursera

The University of Pennsylvania/Wharton: Gamification

Platform: Coursera

Columbia: Financial Engineering And Risk Management (Part 1)

Platform: Coursera

Duke: How To Reason And Argue

Platform: Coursera

Duke: A Beginner’s Guide To Irrational Behavior

Platform: Coursera

The University of Pennsylvania/Wharton: An Introduction To Marketing

Platform: Coursera

The University of Pennsylvania/Wharton: An Introduction To Financial Accounting

Platform: Coursera

The University of Pennsylvania/Wharton: An Introduction To Operations Management

Platform: Coursera

The University of Pennsylvania/Wharton: An Introduction To Corporate Finance

Platform: Coursera

Udacity: How To Build A Startup

Platform: Udacity

MIT/UC Berkeley: Introductory Macro And Microeconomics

Platform: MIT OpenCourseWare/UC Berkeley on ITunesU

MIT: Innovation And Commercialization

Platform: EdX

Columbia: Economics Of Money And Banking

Platform: Coursera

MIT: Introduction To Lean Six Sigma Methods

Platform: MIT OpenCourseWare

BONUS: Ben Bernanke on the Federal Reserve and MIT’s Andrew Lo on Financial Theory

Read more: http://www.businessinsider.com/best-free-online-business-courses-2013-10?op=1#ixzz2hrpOdEyw

 

 

TwitterOAuth! Timelines and streams

PHP library for working with Twitter’s OAuth API.

https://github.com/abraham/twitteroauth

Authenticating a Twitter Feed for OAuth API V1.1 – Timelines & streams

This tutorial is designed to help anyone who is using or wants to create a custom Twitter feed that needs to authenticate basic read-only access for any public user timeline with Twitter OAuth, API V1.1.
http://www.webdevdoor.com/php/authenticating-twitter-feed-timeline-oauth/

Video presentation of Hadoop, by Doug Cutting

A nice video presentation of what Hadoop is, by Doug Cutting, the father of Hadoop.

http://www.cloudera.com/content/cloudera/en/resources/library/aboutcloudera/beyond-batch-the-evolution-of-the-hadoop-ecosystem-doug-cutting-video.html

Description

Apache Hadoop started as batch: simple, powerful, efficient, scalable, and a shared platform. However, Hadoop is more than that. It’s true strengths are:

  • Scalability – it’s affordable due to it being open-source and its use of commodity hardware for reliable distribution.

  • Schema on read – you can afford to save everything in raw form.

  • Data is better than algorithms – More data and a simple algorithm can be much more meaningful than less data and a complex algorithm.

 

Hadoop on Amazon and AirBnB

 

First, it uses Hadoop to support its associates program, in which affiliates post links to Amazon-based products on their websites and get a percentage of related revenue. Originally, Rauser explained, Amazon developers wrote three separate applications in C++ to process and analyze data associated with these transactions to determine how much to pay each affiliate. But the system quickly began to run up against scaling issues, particularly every fourth quarter (typically the busiest quarter of the year for Amazon.) Continue reading Hadoop on Amazon and AirBnB

AjaXplorer aka pydio another dropbox ‘clone’

Pydio allows you to instantly turn any server into a powerful file sharing platform. Formerly known as AjaXplorer (550 000 downloads in Sept.2013), we are now rebranding and this website is dedicated to the community support.

Gain back control and privacy on your data, reduce your costs, and empower your users with Pydio consumer-grade simplicity allied to enterprise-grade compliance. Pydio deploys on top of your legacy filesystems and instantly connects to your existings users directories. Its plugin-oriented architecture makes it ready for all situations.

Pydio (aka. ajaXplorer) is a powerful, open source, web-based file sync tool.
Jack Wallen, Tech Republic

Red Hat Storage + Pydio 5: scale out storage for the enterprise with Dropbox accessibility.
Veda Shankar, Red Hat Storage