Blog

Syndicate content

Statistics in the Cloud With R on OpenShift

Statistics

R is an open-source statistical software. It derives from S a closed-source statistical system. R provides an environment to run and evaluate statistical computation and it is also used for data-mining. I am personally starting with R and I am exploring the possibilities. But before we dive in, let's see what the official page says

R is a language and environment for statistical computing and graphics. It is a GNU project which is similar to the S language and environment which was developed at Bell Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and colleagues. R can be considered as a different implementation of S.

OpenShift Origin Recognized by Github Octoverse for Top Community Contributions

GitHub State of the OctoVerse 2013 One of the wonderful things about working in the open - as we have been for the past 2 years on the OpenShift Origin project - is that transparency keeps you honest, and sometimes you are rewarded with recognition for your efforts.

In Github's recent "State of the OctoVerse" 2013 recap, OpenShift Origin was recognized as one of the movers and shakers on Github. OpenShift's origin-server came in at #5 in the Community Highlights category of "Most Merged Pull Requests".

While some may try to disregard this metric as a side-effect of our automated test and build processes. In reality, this is a clear sign of Red Hat's commitment to working in an open and transparent manner.

Customizing Autoscale Functionality in OpenShift

OpenShift's autoscaling functionality is one of the most asked about features of OpenShift. The autoscaler is a self contained daemon (called haproxy_ctld) that runs inside your gears and it watches HAProxy (the load balancer OpenShift uses to balance across several gears). As the number of people requesting access to your site goes up, haproxy_ctld sends a request to the broker to add more gears. As the number of concurrent users on your app goes down, haproxy_ctld will tell the broker to start destroying gears.

How it Works Today

This actually works really well as a default for auto-scaling behavior. To look at some scaling and queue theory, consider this... If you have just one user using your app, it would be extremely rare that adding more gears would cause better performance. Scaling out doesn't really make things 'faster'. Scaling out, by adding more virtual machines, gears, or whatever, has one specific goal in mind.

12 Steps For Teaching Your Next Programming Class on OpenShift

computer education class using OpenShift picture

The OpenShift Platform as a Service (PaaS) is a valuable resource for running tutorials on web programming, especially if you have a limited budget.

OpenShift abstracts away configuration headaches to help students create shareable applications quickly and easily, for free, using extensible open-source code – as I explained in a previous post.

In this blog post, I will draw on my personal workshop experiences to outline 12 steps for teaching your next programming class with OpenShift Online.

1.

Free Magnolia CMS Hosting on OpenShift

Recently I gave a talk on deploying the Magnolia CMS on OpenShift at the Magnolia CMS annual conference in Basel, Switzerland. The talk was well attended and some fellow developers asked me if I could document the process. In this step-by-step blog post, I will show you how you can have a Magnolia CMS instance up and running on OpenShift within minutes. OpenShift is Red Hat's open source, polyglot,and scalable Platform as a Service.

Prerequisites

Before we can start building the application, we'll have to do a few set-up tasks:

  1. Basic Magnolia CMS knowledge is required. You can refer to the Magnolia CMS documentation for more information.

  2. Basic Git knowledge is required. Git is a distributed revision-control and source code management system.

Build your own Google Maps (and more) with GeoServer on OpenShift

Greetings Shifters! Today we are going to continue in our spatial series and bring up Geoserver on OpenShift and connect it to our PostGIS database. By the end of the post you will have your own map tile server OR KML (to show on Google Earth) or remote GIS server.

The team at Geoserver has put together a nice short explanation of the geoserver and then a really detailed list. If you want commercial support, Boundless will give you a commercial release and/or support for all your corporate needs. Today though I am only going to focus on the FOSS bits.

Getting started

There are two ways to run Geoserver. They ship a version that includes a Jetty container so you can just unzip and run on your local machine.

Red Hat and Hortonworks: Platform for Industrial Internet and Big Data

Today, Red Hat and Hortonworks announced a deep partnership between both companies offering a platform for modern enterprises. Their joint solution offers a platform to help organizations take advantage of big data and open source cloud technologies. This is an important announcement bringing closer collaboration between two companies driven by true open source philosophy. This marriage also brings tighter integration to Red Hat's various enterprise cloud offerings and Hortonworks' Hadoop platform. In this post, I will explain why I think this is an important announcement.

Modern Enterprise and Big Data

The face of enterprise is changing faster than anytime in the past, mainly driven by cloud computing, social, mobile and big data. From consumer web companies to construction companies to health care companies to financial companies, big data processing is becoming more and more prevalent.