Category Archives: Big Data

 No More Google

 No More Google

Privacy-friendly alternatives to Google that don’t track you

https://nomoregoogle.com

Google Chrome
6642 pts Firefox
2044 pts Vivaldi
1969 pts Safari
1926 pts Brave
1125 pts Opera
Google Search
4952 pts Duck Duck Go
1053 pts Qwant
946 pts Startpage
611 pts SearX
Google Chrome Passwords
1963 pts 1Password
1925 pts KeePass
1788 pts LastPass
1188 pts BitWarden
543 pts Dashlane
199 pts Enpass
Google Analytics
797 pts Matomo
407 pts Amplitude
207 pts Fathom
Google Mail
2947 pts FastMail
2461 pts ProtonMail
542 pts Zoho Mail
408 pts Tutanota
398 pts GMX
354 pts Canary Mail
Google Docs
1223 pts Notion
546 pts Bear Writer
437 pts Workflowy
431 pts Zoho
188 pts Quip
66 pts Zoho Docs
Google Sheets
905 pts Airtable
431 pts Zoho
404 pts Coda
188 pts Quip
YouTube
1501 pts Vimeo
804 pts PeerTube
539 pts Dailymotion
83 pts DTube
Google Maps for Websites
613 pts Leaflet
563 pts Mapbox
306 pts Map 🐈
291 pts Nextzen
Google AdWords
148 pts BuySellAds
Google Authenticator
1048 pts Authy
851 pts 1Password
689 pts AndOTP
416 pts Duo
Google Blogger
1323 pts WordPress
1103 pts Medium
656 pts Ghost
396 pts Squarespace
Google DNS
1757 pts Cloudflare
793 pts OpenDNS
476 pts Quad9
Google Drive
1668 pts Dropbox
1118 pts Nextcloud
948 pts MEGA
431 pts Zoho
385 pts Arq Backup
246 pts Backblaze
69 pts SpiderOak
Google Finance
404 pts Bravos
Google Flights
940 pts Skyscanner
340 pts Skiplagged
217 pts Kiwi
65 pts Hipmunk
Google Hangouts
2191 pts Telegram
1806 pts Signal
502 pts Discord
472 pts Slack
437 pts Appear.in
308 pts Wire
73 pts Zoom
15 pts Stealthy
Google Images
1019 pts Unsplash
757 pts Pexels
454 pts 500px
424 pts TinEye
Google Maps
1413 pts OpenStreetMap
1008 pts Here
Google Poly
32 pts Sketchfab
6 pts TurboSquid
Google Scholar
272 pts arXiv
Google Translate
565 pts WordReference
458 pts Translate.com
415 pts Deepl
Google Weather
893 pts Dark Sky

bigdata – Global Science Research (GSR)

About us

Global Science Research (GSR) was founded to optimize marketing strategies with the power of big data and psychological sciences. Our innovative methods produce insight on a revolutionary scale, empowering clients to understand consumers, markets, and competitors more deeply and accurately than ever before. GSR’s team is formed of experienced business analysts and leading academics, all of whom are experienced in helping companies utilize the power of psychology in the customer journey. Our innovative methods produce consumer insight on a revolutionary scale, empowering our clients to understand their consumers, markets, and competitors more deeply and accurately than ever before.

 https://www.linkedin.com/company/global-science-research/

companieshouse.gov.uk

The Infrastructure Behind Twitter: Scale

 

  • Hadoop: We have multiple clusters storing over 500 PB divided in four groups (real time, processing, data warehouse and cold storage). Our biggest cluster is over 10k nodes. We run 150k applications and launch 130M containers per day.

  • Manhattan(the backend for Tweets, Direct Messages, Twitter accounts, and more): We run several clusters for different use cases such as large multi tenant, smaller for non common, read only, and read/write for heavy write/heavy read traffic patterns. The read/only cluster handles 10s of millions QPS whereas a read/write cluster handles millions of QPS. The highest performance cluster, our observability cluster, which ingests in every datacenter, handles over tens of million writes.

  • Graph: Our legacy Gizzard/MySQL based sharded cluster for storing our graphs. Flock, our social graph, can manage peaks over tens of million QPS, averaging our MySQL servers to 30k – 45k QPS.

  • Blobstore: Our image, video and large file store where we store hundreds of billions objects.

  • Cache: Our Redis and Memcache clusters: caching our users, timelines, tweets and more.

  • SQL: This includes MySQL, PostgreSQL and Vertica. MySQL/PosgreSQL are used where we need strong consistency, managing ads campaign, ads exchange as well as internal tools. Vertica is a column store often used as a backend for Tableau supporting sales and user organisations.

 

https://blog.twitter.com/engineering/en_us/topics/infrastructure/2017/the-infrastructure-behind-twitter-scale.html

The Infrastructure Behind Twitter: Scale

Overview of Twitter Fleet

Twitter came of age when hardware from physical enterprise vendors ruled the data center. Since then we’ve continually engineered and refreshed our fleet to take advantage of the latest open standards in technology and hardware efficiency in order to deliver the best possible experience.

https://blog.twitter.com/engineering/en_us/topics/infrastructure/2017/the-infrastructure-behind-twitter-scale.html

dashboard for VMware, SNMP, REST API and more

Simple dashboard system for sysadmins with modules for VMware, SNMP, REST API and more

SysAdminBoard is a simple dashboard system written in Python, HTML and Javascript and served on a simple CherryPy Webserver (included). It was originally written to reformat snmp data for the Panic Statusboard iPad App, but has since become a fully stand-alone project that can grab data from a variety of sources and render charts and graphs in a web browser.

NSA data collection about ‘population control’ not law enforcement – whistleblower

FBI whistleblower Jesselyn Radack joins RT America’s Simone Del Rosario to discuss thye growing concern that the NSA is collecting so much data that it can no longer be effective in preventing terror. Radack says the terror attacks of 9/11 created a ‘blank check’ wherein the usual constraints on surveillance were removed, including probable cause and the necessity of getting a warrant before conducting domestic data collection.