Business Intelligence News

Collaborative Business Intelligence: Integrating BI and KM

A recent survey of senior business executives points to some interesting facts about the state of business intelligence and knowledge management in large corporations.

www.topix.net | 7/29/10 10:18 PM
A Case for Business Intelligence
As organizations grow in scope and complexity, aggregating real-time data from numerous systems and converting that data to decision-ready information becomes increasingly challenging. When striving for Business Performance Improvement (BPI), Business Intelligence technology provides the necessary framework to gain the insight needed to lead to better decisions.

read more

soa.sys-con.com | 7/28/10 6:56 PM
GoodData Secures Another $6.5 million In A Round Led By Fidelity
GoodData is rapidly becoming a key example of the technology innovation emerging from Central Europe - and laying a bet on Europe seems to be paying off for Fidelity Growth Partners Europe , the venture and growth equity investor, which backs European entrepreneurs exclusively. It's invested $2 million in the startup, the second investment for the £100 million fund, leading an overall $6.5 million investment round. GoodData provides an on-demand business intelligence services. Other returning investors include General Catalyst, Andreessen Horowitz and Windcrest Partners. techcrunch.com | 7/28/10 4:37 PM
Social Business Intelligence: The Knowledge Management Connection

Quite often, the most important insights are those that issue from other people's heads, not from our companies' data marts.

www.topix.net | 7/26/10 11:20 PM
Selling the Business Intelligence Implementation
Increasingly more organizations are turning to Business Intelligence professionals to provide historical, current, and predictive views of business operations on their critical information. As a Business Intelligence consultant, your challenge is not usually the sourcing or delivery of the information, but how that information is used going forward once you have left the organization. redir.internet.com | 7/23/10 11:38 PM
SQL Server Integration Services 2008(SSIS) and MySQL
In my previous article I had created linked server, to access and query a database. However, it is just the basic step to try and bring two different RDBMS systems linked together. In this article I will discuss about creating a SSIS package to transfer data from MS SQL Server 2008 to MySQL 5.5 Prerequisite: ___________________________________________________ SQL Server Business Intelligence Development Studio MySQL Connector Net 5.2.7 You can download this package from http://www.mysql.com/downloads/connector/net/ Connector/Net is a fully-managed ADO.NET driver for MySQL. MySQL Connector/ODBC 5.1 (Which is already installed, in our previous article) http://www.mysql.com/downloads/connector/odbc/ Brief about SSIS ___________________________________________________ SSIS provides a graphical front end to design control flow data processing logic.  Once designed, these ‘packages’ are compiled into ‘.dtsx’ packages which can then be distributed and run on any machine with the SSIS tools installed. Packages contain two main logic flows, a ‘Control Flow’ which defines a sequence of logical operations which are processed in sequence. Each step is completed before the next starts e.g. 1.  Empty out work tables in a database 2.  Populate the work tables with data 3.  Perform calculations and update the values in the work table 4.  Update OLAP cubes with the data from the work tables 5.  Run reports against the OLAP cubes. This level of control also allows processing loops to be defined e.g. For each file in a specified folder, read the contents of the file and write it into a specified table. The second main logic flow is the ‘Data Flow’.  This allows for the processing of data at the record level. Data is read from a ‘Data Source’ and passes down a series of ‘Data Transformations’ to a ‘Data Destination’. These transformations can be as simple as changing the data type of fields e.g. varchar(4000) to varchar(2000) or decimal(18,2) to decimal(8,2), or can be more complex like data merges, joins, pivot tables, multicasts etc. Each transformation is represented by an icon in the designer and the icons are linked together to define the logic path. Creating SSIS package to transfer data from MS SQL Server 2008 to MySQL ___________________________________________________ Follow the below steps to create a SSIS package for data transfer. Go to START>MS SQL SERVER 2008>SQL SERVER BUSINESS INTELLIGENCE DEVELOPMENT STUDIO and Click on File>New>Project Under Business Intelligence Projects, select Integration Services Project and rename the Project Name. This will open a Package Designer Screen, where you can start designing the data flow for your package. Configure Connection Managers for you package ___________________________________________________ Right Click in the area where it shows the Connection Manager tab and select New ADO.Net Connection. You will see the Configuration Manager Editor window Click New Now you need to define connection manager configuration for source (i.e. MS SQL Server 2008) in our case. In the Connection Manager Editor, by default the Provider is set to .Net Providers\SqlClient Data Provider Use your test server name or IP for Server Name and under connect to database, select the database to use. I’m using ssistest. Similarly, you need to create connection manager configuration for Destination (i.e. MySQL) Right Click, under Connection Manager’s tab and Select ADO.NET Connection as mentioned above and click New on Connection Manager Editor. For Provider click on the drop down arrow to select .Net Providers\odbc data Provider. In the previous article we had created System DSN name MySQL. We will use the same here for Use user or system data source name. Enter your login information and Test Connection. It should succeed, and then click Ok. Now, on Configure ADO.NET Connection Manager screen, you can see both the source and destination are configured. It’s now time to add control flows to the package, ensure you are on Control Flow tab and Drag Data Flow Task from the Control Flow Items under ToolBox. Now click on the Data Flow tab above and Drag ADO NET Source and ADO NET Destination as shown below. You can rename the Source and Destination Names from Properties. Right click on ADO NET SOURCE i.e. MS SQL Server 2008 and click EDIT. Make the below changes as mentioned on the screen. As mentioned before I have ssistest database which has few sample tables, that I have exported from AdventureWorks database. I’m selecting one of the tables named HumanResources.Department for this example. In the above screen you can see the test server selected with database and table to transfer the data from. Similarly right click on MySQL (Destination) and click Edit. On this screen, under ADO.NET Connection Manager select MySQL.root from the drop downlist. Since we do not have any destination table to map with the source, we need to click on New. It will show you the below message, click OK Now remove the quotes and Click OK to create a table on destination as shown. It will come back to the editor, Click on the Mappings tab and check if all the columns are mapped and click OK. This will complete the data flow design of our package. Saving the SSIS Package ___________________________________________________ Once the package is created, save the copy of it to SSIS Package store or MSDB. Follow the below steps, Click on File> Save Copy of Package.dtsx As from the menu it will open a window, Select SSIS Package Store as the Package Location and Select the name of your test server. Enter /File System/Export2MySQL as the package path and Click OK. Select File > Save Copy of Package.dtsx as again from the menu and Select SQL Server as the Package Location. Select the name of your test server and fill in your authentication information. Enter Export2MySQL as the package path and Click OK. Once the copies are saved you can see them under Integration Services Stored Packages as shown below. Changes on MySQL ___________________________________________________ Before running the Package, make this change on the MySQL Server mysql> SET GLOBAL sql_mode= ‘ANSI’ Executing the Package ___________________________________________________ You can execute the package either from SSMS and BI Development Studio. I’m doing it from the BI, on the right hand side your BI screen you will find solution explorer, which has the Package.dtsx listed. Right click and select Execute Package. The moment you click execute the debugging process start and the Output is show below on the BI screen.  If the Data Flow Task is changed to Green, it means the package has succeeded, and if it is Red it means somewhere in the flow there was an error. On the Data Flow Tab, you can see the number of rows transferred in the execution. There is a progress tab on the screen, where you can check the steps taken in the flow, it also list errors and warnings if any. Query the MySQL Server to see if the data transfer was successful as show below. This was about transferring data from MS SQL Server 2008 to MySQL, you can do vice verse by changing the source and destination while designing the package on Data Flow tab. Hope this article is useful. dbperf.wordpress.com | 7/23/10 1:34 PM
Mining the Contact Center for BI Gems
The use of analytics in contact centers enables management to extract critical business intelligence that would otherwise be lost. By analyzing and categorizing recorded conversations between companies and their customers, useful information can be discovered relating to strategy, product, process, customer satisfaction/retention, and operational issues. www.ecommercetimes.com | 7/23/10 1:00 PM
Jinfonet Announces JReport 10
Jinfonet Software, a provider of Java reporting solutions, on Thursday unveiled JReport 10. This new version adds rich visualization and interactive reporting to a robust, agile BI platform, providing embedded operational reporting to developers and self-service reporting to end users. JReport 10 brings Agile Business Intelligence to the next level with rich visualization features. Web 2.0 self-service reporting allows highly interactive reports to be accessible across the enterprise with superior performance and scalability.

read more

java.sys-con.com | 7/22/10 4:35 PM
Review: 50 Open Source Financial Applications

From balancing your checkbook to running your giant globalcorp, open source has accounting, ERP, business intelligence, billing, human resources, investment, point of sale, spreadsheets, and time-tracking software for everyone; Cynthia Harvey assembles this collection of 50 to get you started.

www.linux.com | 7/21/10 12:36 AM
Instant Access for Salesforce, SharePoint and Google Docs with Loqu8 Insight 2.5

Loqu8 today announced Insight 2.5, the first business intelligence software to speed access for enterprise cloud-based applications.

www.topix.net | 7/17/10 6:54 AM
BI Vendor Qlik Technologies Pops in IPO
While the rest of the stock market licked its wounds following Google's lukewarm earnings report, business intelligence software developer Qlik Technologies bucked the trend and took flight in its first day of trading.


redir.internet.com | 7/16/10 11:53 PM
The Evolution of Testing in the Cloud
It's always exciting to be on the front end of a trend. Some turn out to be short-lived fads while others, such as the electronic distribution of music, disrupt entire industries. There are some who see cloud computing as a fad. Those who share that view tend to look at the Cloud as if it's an all or nothing proposition; if you can't move your entire infrastructure to the Cloud it must certainly be a passing fancy. But the Cloud is here to stay. There are many applications, like compute intensive statistical analysis, business intelligence and testing, that are incredibly well-suited for cloud computing. We're proud to have been first in the Cloud with a commercial testing solution and have been keen observers of the evolution of testing in the Cloud. What have we seen?

read more

soa.sys-con.com | 7/16/10 2:15 PM
Data mart or data warehouse?
This is part two in my six part series on business intelligence, with a focus on OLAP analysis. Part 1 – Intro to OLAP Identifying the differences between a data warehouse and a data mart. (this post) Introduction to MDX and the kind of SQL which a ROLAP tool must generate to answer those queries. Performance challenges with larger databases, and some ways to help performance using aggregation. Using materialized views to automate that aggregation process. Comparing the performance of OLAP with and without aggregation over multiple MySQL storage engines at various data scales. What is a data warehouse? It turns out that this question is a little more difficult to answer than it probably should be. This is because data warehousing has become an overloaded term that includes BI tools (OLAP/data mining), data extraction and transformation tools (ETL), and schema management tools. To me, the definition of data warehouse is “A relational database schema which stores historical data and metadata from an operational system or systems, in such a way as to facilitate the reporting and analysis of the data, aggregated to various levels.” This definition is a consolidation of various definitions that I have encountered. There are a few key points here. First, data warehouses rarely contain information that exists no where else in an organization. The goal of data warehousing is to collect and make a historical record of the information from another system. This might be an ERP application, the logs from a web application, data from manufacturing systems or even data from radio telescopes. This data is extracted from the source system(s) and then cleaned up and inserted into the data warehouse with ETL tools. This process is usually called “conforming” the source data into the warehouse schema. Another important aspect of the definition is aggregation. A data warehouse is usually used to summarize data over years, months, quarters, or other time dimension attributes. This aids in identifying historical trends and making predictions about future trends. Data is often aggregated in many different ways. Aggregated data may be stored in aggregated tables so that it can be accessed quickly. This is particularly important as fact tables reach into the billions of rows and hundreds of gigabytes of information is accumulated. Accessing this data outside of summarized form often takes a very long time. Is there a particular schema design which lends itself to this historical analysis? There are two main methodologies which are practiced when it comes to designing database schemata for database warehouse applications. These two methodologies approach the problem of storing data in very different ways. The first methodology was popularized by Bill Inmon, who is considered by many to be the “father” of the data warehouse, or at least the first dw “evangelist” if you will. This approach focuses on the normalization of data. Highly normalized schema are created and maintained by ETL jobs. Creating and maintaining these jobs is often one of the biggest parts of designing and running a data warehouse. A particular combination of ETL jobs which consist of one or more data transformations is usually called a “flow”. An example ETL flow might combine data from item and category information into a single dimension, while also maintaining the historical information about when each item was in each category. These types of warehouses are almost always “insert only”. Data is very likely never updated or deleted in these databases and they are expected to grow to very large sizes, usually into the terabyte range, but sometimes even into petabytes. Aggregations are the exception to this rule, as they must be updated periodically to reflect the additions of data to the source tables. The goal of this methodology is the 100% accurate description of historical data from the operational system in a normalized manner which ensures that it is able to be updated quickly. It is accepted that analysis of the data will be more complex in this form, but that this complexity is an acceptable trade off for historical accuracy. This is often described as the “top-down” approach. What is a data mart? The second approach, popularized by Ralph Kimball holds that partial de-normalization of the data is beneficial. The goal of a this approach is usually multi-dimensional (OLAP) analysis as it is very hard to create a dimensional model from a highly normalized database schema. It is particularly difficult to build such a model that scales as the volume in the warehouse increases. For this reason OLAP analysis usually is performed on a star schema which partially denormalizes the data. A star schema about a particular subject matter, such as sales, is usually referred to as a “data mart”. Maybe this is because they provide one stop shopping for all the information about the particular subject matter. That is pretty much what I imagine when I hear the phrase. Data marts tend to be updated frequently, at least once per day. As I mentioned in my previous post, a star schema consists of a central table called the fact table and additional dimension tables which contain information about the facts, such as lists of customers or products. Because of the partially denormalized nature of a star schema, the dimension tables in a data mart may be updated. In fact, there is a term for such a dimension – A “slowly changing dimension” or SCD. The fact table is usually only inserted to, but older data may be purged out of it. Sometimes the fact table will be aggregated from source data. A website which sells banner ads might roll up all the events for a particular ad to the day level, instead of storing detailed information about every impression and click for the ad. A normalized data warehouse schema might contain tables called items, categories and item_category. These three tables allow a user to determine which items belong to which categories, but this structure creates a large number of joins when many dimensions are involved. A data mart would collapse all of this information into an item dimension which would include the category information in the same row as the item information. It would be possible create two different dimensions, product and category, but performance tends to decrease as the number of dimensions increases. The difference illustrated In this mock ERD diagram you can three schemata representing sales orders. The star schema is very denormalized, having only four tables which represent the subject. The data warehouse schema, on the other hand, is very normalized and requires tens of tables to represent the same subject. The snowflake schema is a compromise between the two extremes. Typical star, snowflake and data warehouse schemata. Is one better than the other? In the game of data warehousing, a combination of these methods is of course allowed. A company might take the top-down approach where they maintain a large historical data warehouse, but they also build data marts for OLAP analysis from the warehouse data. A different approach is to build a relational warehouse from multiple data marts, or the so-called bottom-up approach to data warehousing. There is also a cousin of the star schema in which the dimensions are normalized. This type of schema is usually called a snowflake schema. The three table item/category/item_category tables in the warehouse schema example would be considered a snowflake. A dimension table (item) must be joined to additional tables (item_category,category) to find the category. These are not as popular as star schemas because they tend to not perform as well as a star schema, particularly as the volume of data in the database increases. So what is the big deal? From a OLAP performance standpoint, many databases will perform better on a star schema than on a snowflake or fully normalized schema at data warehouse volumes. This is in large part because commercial database software supports hash joins, bitmap indexes, table partitioning, parallel query execution, clustered tables and materialized views. These features make working with a star schema much easier than it may be on MySQL, but it is definitely possibly to use MySQL as long as the right tools and techniques are used. There are ways in which can add some of these features to MySQL as well, but that is a topic for a later post. In the next post I’ll talk more about Mondrian and about MDX, the multi-dimensional query language. Mondrian turns MDX into SQL, so we’ll also look at the kinds of queries which are generated by OLAP analysis. Entry posted by Justin Swanhart | No comment Add to: | | | | www.mysqlperformanceblog.com | 7/15/10 5:41 PM
Report: Java and MySQL doing fine under Oracle

A new developer survey report from open-source business intelligence vendor Jaspersoft shows that there has been minimal fallout from Oracle's acquisition of Sun Microsystems, and that Java and MySQL seem to be doing just fine in their new home .

www.topix.net | 7/15/10 3:14 AM
Motorola is Hiring!
ANDROID SMART PHONE Senior DATA WAREHOUSING ENGINEER “We are at the intersection of all things cool in the valley” We are a Sunnyvale, CA based software applications and services development team in the heart of Silicon Valley that is combining the best of the internet, messaging, and social networking into intuitive software, called MOTOBLUR. MOTOBLUR is an innovative cloud-assisted solution to manage and integrate messaging, social networking, and other web services and will debut on the T-Mobile/Motorola CLIQ with MOTOBLUR Android smartphone. This product aggregates contacts, posts, messages, photos and more from sources including Facebook®, Twitter™, MySpace®, Gmail™, and Microsoft Exchange and many more to come. In collaboration with Google’s Android team, we are driving and developing a massively scalable ecosystem. Android is the first free, open source and fully customizable platform built for the mobile internet. Android offers a full stack: an operating system, middleware, and key mobile applications from Google and other. It also contains a rich set up API’s that allows third party developers to develop great applications. We’re financially strong and stable, in the last year generating $300M in positive cash flow; with over $7B in cash and short term investments. Be part of something “innovative & cool” which will reshape the mobile internet and be used by millions of people worldwide. Be part of the resurgent Motorola Mobile Devices business and create applications and services that will enhance the lives of everyone who uses them. Scope of Responsibilities/Expectations: Implement and maintain a high-volume, high-availability hybrid (SQL + NoSQL) data processing infrastructure that consists structured and semi-structured data. Design and implement a large distributed data warehousing and reporting solution and integrate it with 3rd-party business intelligence tools. Basic Qualifications: BS or MS in Mathematics, Statistics or Computer Science Specific Knowledge/Skills: BS or MS in Mathematics, Statistics or Computer Science Expert level Java development experience with emphasis on high performance and scalability Mastery of Hadoop, particularly HDFS, Java MapReduce, Pig, Hive, HBase Strong MySQL experience and strong knowledge of RDBMS fundamentals Working knowledge of ETL, business intelligence and data analytics, preferably Business Objects. Experience working in a Linux environment on a daily basis Hands-on Unix-oriented scripting: Shell, Awk, etc. Successful at establishing strong working relationships with cross-functional teams Strong verbal and written communication skills Working knowledge of Machine Learning methodologies or other regression methodologies like Mahout. Search and Indexing technologies like Lucene/ Solr Other NoSQL experience with “eventual consistency” platforms like Cassandra and Voldemort is a plus APPLY DIRECTLY TO JOB #93304 at: http://www.motorolacareers.com/jobsearch_frames2.cfm?w=search ABSOLUTELY NO AGENCIES OR THIRD PARTY SOLICITATION! everythingmysql.ning.com | 7/14/10 6:39 AM
Open Source BI -- Pentaho and Jaspersoft Part I
Hey DBAs! Are you seeking more efficient ways of shifting through your data to aid your business operations? Two popular Business Intelligence products have community Open Source software are Pentaho and JasperSoft. And both work with MySQL.Both are easy to download and install. Both will use a JDBC connector to connect to MySQL. But how easy are the two to configure and run a simple report against a running instance of MySQL? Setting up a JDBC connection with JasperSoft or Pentaho is pretty much like using any other JDBC connection. The next step is to setup a query like SELECT name, job_title, department FROM employees, departments WHERE employees.emp_id = departments.emp_id. Either package will let you pick a variety of output templates. Then you have the BI software merge your query with the template. I honestly think an average MySQL DBA could fairly quickly generate a nice looking report from their instance and that JasperSoft would be just a little bit faster.In part two of this series, the steps will be more detailed and documented. There will also being comparing and contrasting of the two products. Both products are part of larger projects and there are many useful tools that work with the BI software that you will want to investigate. More on those in later posts.And in a short time you should be able to download a Virtual Box image with both community BI programs and a InfiniDB instance with some data sets. This way you can test all three simply. I would also consider doing a VMWare version if there is demand for it. dave-stokes.blogspot.com | 7/14/10 12:03 AM
Business intelligence poised to help fuel telco growth

BI can help carriers find new revenue-generation opportunities as well as control unproductive expenditures such as superfluous network builds in non-profitable areas.

www.topix.net | 7/13/10 6:10 PM
Intro to OLAP
This is the first of a series of posts about business intelligence tools, particularly OLAP (or online analytical processing) tools using MySQL and other free open source software. OLAP tools are a part of the larger topic of business intelligence, a topic that has not had a lot of coverage on MPB. Because of this, I am going to start out talking about these topics in general, rather than getting right to gritty details of their performance. I plan on covering the following topics: Introduction to OLAP and business intelligence. (this post) Identifying the differences between a data warehouse, and a data mart. Introduction to MDX queries and the kind of SQL which a ROLAP tool must generate to answer those queries. Performance challenges with larger databases, and some ways to help performance using aggregation. Using materialized views to automate that aggregation process. Comparing the performance of OLAP with and without aggregation over multiple MySQL storage engines at various data scales. What is BI? Chances are that you have heard the term business intelligence. Business intelligence (or BI) is a term which encompasses many different tools and methods for analyzing data, usually presenting it in a way that is easily consumed by upper management. This analysis is often used to determine how effectively the business has been at meeting certain performance goals, and to forecast how they will do in the future. To put it another way the tools are designed to provide insight about the business process, hence the name. Probably the most popular BI activity for web sites is click analysis. As far as BI is concerned, this series of posts focuses on OLAP analysis and in a lesser sense, on data warehousing. Data warehouses often provide the information upon which OLAP analysis is performed, but more on this in post #2. OLAP? What is that? OLAP is an acronym which stands for online analytical processing. OLAP analysis, which is really just another name for multidimensional analysis, consists of displaying summary aggregations of the data broken down into different groups. A typical OLAP analysis might show “sale total, by year, by sales rep, by product category”. OLAP analysis is usually used for reporting on current data, looking at historical trends and trying to make predictions about future trends. Multidimensional Analysis Multidimensional analysis is a form of statistical analysis. In multidimensional analysis samples representing a particular measure are compared or broken down into different dimensions. For example, in a sales analysis, the “sale amount” is a measure. Measures are always aggregated values. That is, total sales might be expressed as SUM(sale_amt). This is because the SUM of the individual sales will be grouped along different dimensions, such as by year or by product. I’m getting a little ahead of myself. Before we talk about measures and dimensions, we should talk about the two ways in which this information can be stored. There are two main ways to store multidimensional data for OLAP analysis OLAP servers typically come in two basic flavors. Some servers have specialized data stores which store data in a form which is highly effective for multidimensional analysis. These servers are termed MOLAP and they tend to have exceptional performance due to their specialized data store. Almost all MOLAP solutions pre-compute many (or even all) of the possible answers to multi-dimensional queries. Palo is an example of an open source version of this technology. ESSbase is an example of closed source product. MOLAP servers often feature extensive compression of data which can improve performance. Loading data into a MOLAP server usually takes a very long time because many of the answers in the cube must be calculated. The extra time spent during the load is usually called “processing” time. A relational OLAP (or ROLAP) server uses data stored in an RDBMS. These systems trade the performance of a multidimensional store for the convenience of an RDBMS. These servers almost always query over a database which is structured as a STAR or snowflake type schema. To go back to the sales analysis example above, in a STAR schema the facts about the sales would be stored in the fact table, and the list of customers and products would be stored in separate dimension tables. Some ROLAP servers support the aggregation of data into additional tables, and can use the tables automatically. These servers can approach the performance of MOLAP with the convenience of ROLAP, but there are still challenges with this approach. The biggest challenges are the amount of time that it takes to keep the tables updated and in the complexity of the many scripts or jobs which might be necessary to keep the tables in sync. Part five of my series will introduce materialized views which attempt to address these challenges in a manageable way. What makes a ROLAP so great? An OLAP server usually returns information to the user as a ‘pivot table‘ or ‘pivot report’. While you could create such a report in a spreadsheet, the ROLAP tool is designed to deal with millions or even billions of rows of data, much more than a spreadsheet can usually handle. MOLAP servers usually require that all, or almost all of the data must fit it memory. Another difference is the ease by which this analysis is constructed. You don’t necessarily have to write queries or drag and drop a report together in order to analyze multidimensional data using an OLAP tool. Data before pivoting: Data summarized in pivot form: ROLAP tools use star schema As I said before, a sale amount would be considered a measure, and it would usually be aggregated with SUM. The other information about the sale, such as the product, when it was sold and to whom it was sold would be defined in dimension tables. The fact table contains columns which are joined to the dimension tables, such as product_id and customer_id. These are often defined as foreign keys from the fact table to the dimension tables. A note about degenerate dimensions: Any values in the fact table that don’t join to dimensions are either considered degenerate dimensions or measures. In the example below the status of the order is a degenerate dimension. A degenerate dimension is stored as an ENUM in many cases. In the example below that there is no actual dimension table which includes the two different order statuses. Such a dimension would add an extra join, which is expensive. Any yes/no field and/or fields with a very low cardinality (such as gender or order status) will probably be stored in the fact table instead of in a dedicated dimension. In the “pivot data” example above, all the dimensions are degenerate: gender, region, style, date. Example star schema about sales. Often a dimension will include redundant information to make reporting easier, a process called “denormalization”. Hierarchical information may be stored in a single dimension. For example, a dimension for products may include both the category AND a sub-category. A time dimension includes year, month and quarter. You can create multiple different hierarchies from a single dimension. This allows ‘drill down’ into the dimension. By default the data would be summarized by year, but you can drill down to quarter or month level aggregation. The screenshots here in the jPivot (an OLAP cube browser) documentation can give you a better idea about the display of data. The examples break down sales by product, by category, and by region. The information is presented in such a fashion that it can be “drilled into” and “filtered on” to provide an easy to use interface to the underlying data. Graphical display of the data as pie, line or bar charts is possible. Focusing on ROLAP. This is the MySQL performance blog, and as such an in depth discussion of MOLAP technology is not particularly warranted here. Our discussion will focus on Mondrian. Mondrian is an open source ROLAP server featuring an in-memory OLAP cache. Mondrian is part of the Pentaho open source business intelligence suite. Mondrian is also used by other projects such as Wabit and Jaspersoft. If you are using open source BI then you are probably already using Mondrian. Closed source ROLAP servers include Microstrategy, Microsoft Analysis Services and Oracle BI. Mondrian speaks MDX, olap4j and XML for analysis. This means that there is a very high chance that your existing BI tools (if you have them) will work with it. MDX is a query language that looks similar to SQL but is actually very different. Olap4j is an OLAP interface for java applications. XML for analysis (XMLA) is an industry standard analytical interface originally created by Microsoft, SAS and Hyperion. Whats next? Next we’ll talk about the difference between data marts and data warehouses. The former are usually used for OLAP analysis, but they can be fundamentally related to a warehouse. Entry posted by Justin Swanhart | No comment Add to: | | | | www.mysqlperformanceblog.com | 7/12/10 8:26 PM
Mobile Entr�e 2.0: Mobile Business Intelligence, Dashboards For SharePoint

Mobile Entrée version 2, BI, Dashboards and Branding for your MobileH3 Solutions (newssite) has announced the second major release of their SharePoint mobile application framework, Mobile Entrée, and it’s packed with business intelligence, dashboards and custom theming.

Read full story... www.cmswire.com | 7/12/10 5:16 PM
Oracle Releases BI 11g, Creates Single Business Intelligence Environment

logo-oracle.gif The London launch of Oracle’s (news, site) 11g Business Intelligence Enterprise Edition this week is the culmination of three years work to pull together all its business intelligence tools and build on the technologies it acquired with the Siebel and Hyperion acquisitions.

Read full story... www.cmswire.com | 7/9/10 5:24 PM
Philippines lags behind Asian neighbors in digital TV popularization
The Philippines lags behind other Asian countries and regions in the popularization of digital TV, according to the latest figures released Thursday by Informa Telecoms and Media, a leading provider of business intelligence and strategic services to the global telecoms and media markets. The Philippines ranked second from the bottom of the list in terms of digital TV penetration, with just 5 percent in 2009, up from 1 percent in 2005. By 2015, the country's digital TV penetration will ... english.people.com.cn | 7/8/10 9:26 AM
Oracle releases Business Intelligence 11g
Database giant updates its BI platform with a variety of new services and integration into Oracle's growing array of software products.


redir.internet.com | 7/8/10 12:13 AM
Foursquare Is Five Times Larger Than Gowalla And Growing 75 Percent Faster Every Day
Editor's note: The following analysis is written by Robert J. Moore, the CEO and co-founder of RJMetrics, an on-demand database analytics and business intelligence startup. Robert blogs at The Metric System and can be followed on Twitter at @RJMetrics. Location-based social networks Foursquare and Gowalla are accumulating users (and headlines) with impressive momentum. While both companies have been vocal about reaching major milestones, we wanted to take a closer look at the data behind these accomplishments.  Based on our estimates, Foursquare is not only bigger in terms of both users and venues, but it also is growing at a faster clip than Gowalla. For the past four weeks, we've been monitoring the Foursquare and Gowalla APIs to track growth rates, as well as to sample users and venues. This data was loaded into an RJMetrics Dashboard , which provided the results found here with just a few clicks. We will keep these estimates up-to-date with fresh data and you can view them any time at our Startup Data page. Here are a few highlights from our findings: techcrunch.com | 7/7/10 9:50 PM
Completing Field Force Automation - Extending Intelligent Mobility
I recently wrote an article for The Enterprise Mobility Foundation, The Next Step in Business Process Optimization: Mobility. In this article I describe the enormous amount of work that has gone into designing and automating business processes and ERPs like SAP. The conclusion of the article is that as sophisticated as ERPs are today, there remains significant feature gaps, especially in extending business processes out to the mobile workforce. In this article, I want to highlight some areas where there is still much work to be done in field services. In field services, a number of SAP mobility partners like Clicksoftware and Syclo have comprehensive field services and enterprise asset management solutions with mobile clients for use on smartphones and ruggedized handhelds. However, there are still feature gaps in the integration of real time data, M2M integration, geotags, business intelligence, augmented reality and multi-media support.

read more

soa.sys-con.com | 7/7/10 5:11 PM
EMC Buys Greenplum in Data Warehousing Play
EMC broadens its IT software stack with the purchase of a developer of massively parallel processing database software for business intelligence and analytics.


redir.internet.com | 7/7/10 12:26 PM
EMC Acquires Data Warehousing And Analytics Company Greenplum
Enterprise software giant EMC has acquired data warehousing company Greenplum Software . Financial terms of the deal were not disclosed but the acquisition is an all-cash transaction. Greenplum, which has raised $61 million in funding, develops database software for business intelligence and data warehousing applications. Greenplum has a number of high profile investors, including Sun Microsystems and SAP Ventures. The company's client base includes Skype, Equifax, T-Mobile and Fox Interactive Media Greenplum will become the foundation for a new data computing product division within EMC's Information Infrastructure business. techcrunch.com | 7/6/10 9:36 PM
Microstrategy rolls out BI app for iPhone, iPad
Microstrategy is releasing a new business intelligence app for the iPhone.

www.macworld.com | 7/6/10 6:15 PM
Microstrategy Rolls out BI App for IPhone, IPad (PC World)
PC World - Microstrategy on Tuesday is announcing a next-generation mobile BI (business intelligence) application for Apple's iPhone and iPad, joining the growing pack of enterprise vendors that are embracing the red-hot platform. us.rd.yahoo.com | 7/6/10 2:30 PM
Planet Eclipse: EclipseLive: Upcoming Event: Reminder - New BIRT 2.6 Features in Helios
Event Date: July 8, 2010 1:00 pm GMT-8

Register Now

Jason Weathersby (Actuate)
 
Abstract:

This webinar introduces new features provided by the Business Intelligence and Reporting Tools (BIRT) project with its 2.6 release. BIRT is a powerful reporting framework that is part of the open source Eclipse initiative. Using BIRT, developers can incorporate reports into their applications without the need for time-consuming custom code, or they can build on and extend BIRT to provide valuable reporting services for their applications and products.

This session details improvements in the Helios release, such as the addition of a Polar/Radar style charts, palette hashing, sorting improvements, multiple resource files and class path enhancements. The following topics will be covered:

  • BIRT Overview and Architecture
  • What is BIRT
  • High Level BIRT Architecture
  • What’s new with BIRT 2.6
    • Charting Improvements
    • Resource Files
    • Classpath Enhancements
    • Sorting Improvements
    • SQL Query Builder
    • View Time Variable Evaluation
    • Overridden Library Properties

Total running time will be approximately 30 minutes

6:00 am PDT / 9:00 pm EDT / 1:00 pm GMT / 3:00 CEST - Convert to other time zones


delicious delicious | digg digg | dzone dzone
live.eclipse.org | 7/2/10 4:25 PM
Breaking Down Microsoft BI

I get a lot of questions around what is Microsoft Business Intelligence and I wanted to explain the answer from several perspectives.

www.topix.net | 6/30/10 5:34 AM
451 CAOS Links 2010.06.29
Elephants on parade: Hadoop goes mainstream. And more. Follow 451 CAOS Links live @caostheory on Twitter and Identi.ca “Tracking the open source news wires, so you don’t have to.” Elephants on parade # Cloudera launched v3 of its Distribution for Hadoop and released v1 of Cloudera Enterprise. # Karmasphere released new Professional and Analyst Editions of its Hadoop development and deployment studio. # Talend announced that its Integration Suite now offers native support for Hadoop. # Yahoo announced the beta release of Hadoop with Security and Oozie, Yahoo’s workflow engine for Hadoop. # Datameer announced a strategic partnership with Zementis for predictive analytics on Hadoop. # The Register reported that Twitter is set to open source its MySQL-to-Hadoop tool. # MicroStrategy announced support for Apache Hadoop as a data source for MicroStrategy 9. # Appistry announced Hadoop-based strategic alliances Concurrent, Datameer and Kitenga. # GOTO Metrics released Data Analytics Platform, a Hadoop-based business intelligence platform. Best of the rest # The Software Freedom Law Center responded to the Supreme Court’s decision on Bilski v. Kappos, while Mark Radcliffe provided his thoughts. # David Wiley discussed openness, radicalism, and tolerance (and the lack of it). # Jorg Janke discussed how Compiere overstepped the balance between proprietary and open product components. # Simon Phipps argued that open core is bad for software freedom. # Nick Halsey joined SugarCRM as chief marketing officer. # DotNetNuke more than doubled its subscription customers in 1H10 to nearly 800, expects 400% FY revenue growth. # Nuxeo announced its new Nuxeo Case Management Framework. # Mike Masnick discussed why the lack of billion dollar pure play open source software companies is a good thing. # The Apache Software Foundation announced Apache Tomcat Version 7.0. # Glyn Moody asked whether Oracle has been a disaster for Sun’s open source. # Infoworld discussed eight business strategies for profiting from open source software. # Computerworld reported that Red Hat CEO sees VMware as biggest competitor. # IBM published an essay on the role Linux plays in its smarter planet initiative. # Groklaw asked, What did Microsoft know about SCO’s plan to attack Linux, and when did it know it? # Mozilla won the American Business Award for the most innovative company of the year. blogs.the451group.com | 6/30/10 1:14 AM
IBM Rolls Out BI Offerings for Traders
Software package brings business intelligence and speed to the financial services market to find trading triggers among more than 100 different data feeds.


redir.internet.com | 6/24/10 12:07 PM
Planet Eclipse: Wayne Beaton: Eclipse is? a Tools Platform

My last few blog posts discuss a presentation that I regularly deliver titled “What is Eclipse?“. As previously discussed, I tend to introduce people to Eclipse by starting with what they already know: Eclipse is a movie about vampires that I have no intention of watching. Eclipse is a Java IDE. And, as I’ve stated previously, a darned good one. From there, I try to expand the horizon of the listener by introducing the notion of Eclipse as a IDE platform.

To reinforce the notion of Eclipse as a platform, I then generalize beyond the notion of an IDE and introduce Eclipse as a Tools Platform.

I think it’s pretty common knowledge that the original work on Eclipse was done by particularly smart group at IBM (yes, I’m fawning). What’s not common knowledge is that IBM didn’t initiate work on Eclipse to create yet another Java IDE. Eclipse was created to solve a problem. A big problem.

In the mid- to late-nineties, a lot of very powerful tools became available: we had some pretty fantastic tools for doing Smalltalk work; we had great tools for doing Java development; we had great pretty good tools for doing web development; we had great tools for doing database work; and more (there were even a smattering of modeling tools available). The problem was that all of these tools were different. They had different user interfaces. They interacted with different version control and issue tracking systems. You couldn’t keep all your files for one project in one place. It was–despite all the power–a huge mess.

Eclipse was created to be a powerful integration platform that could bring all the tools into alignment. Just the ability to have all your code, property files, documents, HTML, XML, etc. in once place, managed by a single version control system in a single, consistent manner is immensely powerful. A consistent metaphor for user interaction reduces the training burden, thereby making developers productive across a suite of tools in a short period of time. The Java development tools were, as I’ve heard it, created for two reasons: first, as an example to prove that the Eclipse platform could be used to build a world-class Java IDE; and second, to provide world-class tools for building the Eclipse platform itself.

All of the functionality in Eclipse is delivered as a collection of components, or “plug-ins”. The Java development tools themselves are just a bunch of plug-ins that provide Java development functionality. Those plug-ins can be removed, or augmented with additional plug-ins. I use the term “plug-in” because it’s a term that most people understand. But the truth is that I’ve come to be very uncomfortable with the term: the word “plug-in” implies that there is some monolithic chunk of code that lets you insert a little custom behaviour using a highly-restrictive API. But that’s not the case: all of the functionality in Eclipse is a component (even the base workbench, and the component management system itself are components). All components, including those that come with Eclipse and ones that are added by you or third-parties, have access to the same APIs. It’s like a software utopia (think Tron without the MCP).

So Eclipse has this wonderful modular non-architecture that lets you easily augment behaviour by adding new components into the mix. Today there are literally hundreds–if not thousands–of first-class tools that are part of the loosely-coupled-yet-tightly-integrated wonderfulness of Eclipse. The Business Intelligence and Reporting Tools (BIRT) project provides tools that business analysts (i.e. non-programmer-types) use to build world-class reports with feature-rich charts and more. The Data Tools project provides tools for database administrators. The Web Tools project provides a wealth of tools for web designers, developers, deployers, and testers. The Test and Performance Tools Platform (TPTP) project provides tools for–obviously–test and performance work. Of course, I can’t leave out modeling. Modeling at Eclipse, like Eclipse itself, is relatively hard to define. The Modeling project provides tools, but it also produces runtime frameworks (and how do you categorize metamodels?) It’s probably easiest to, again, start with something people understand: modeling tools, like UML2 Tools, and–of course–The Eclipse Modeling Framework with tools to create, compare, validate, query, and manipulate models.

In short, it’s not all about developers.

The Eclipse Marketplace provides a door to a world of tools that make Eclipse do things that the original designers could not possibly imagined. But there’s more to it. Eclipse is… an Application Framework.

dev.eclipse.org | 6/22/10 8:28 PM
Salesforce Chatter Aids Collaboration in the Enterprise
Salesforce.com is encouraging enterprises to get chatty with the release of its Chatter cloud-based enterprise social-collaboration platform. Analysts said this could signal an impending wave of similar solutions in industries beyond Customer Relationship Management (CRM) because Chatter creates opportunities for developers to build and resell apps that include the platform.

Chatter leverages familiar social-networking features from Facebook and Twitter, including profiles, status updates, and real-time feeds. The platform lets enterprises collaborate around documents, but it also lets employees "follow" people, business processes, and application data. The goal is to boost business intelligence by driving real-time insights into programs, projects, people, customers, cases, documents and business data.

"Salesforce Chatter is nothing short of revolutionary for the enterprise," said Denis Pombriant, founder and managing principal of Beagle Research Group. "Salesforce Chatter creates an entirely new way to work for employees by opening up lines of communication and ad hoc coordination across every level of an organization. Enterprises can now realize new highs in productivity by harnessing the power of social collaboration."

What is Cloud 2?

Salesforce Chatter is a Cloud 2 application. Much like Web 2.0, Cloud 2 is the next generation of enterprise cloud computing, which is social, mobile and real time. Familiar Cloud 2 applications include Facebook, Google and Twitter. But Chatter is the first Cloud 2 collaboration application for the enterprise.

Farmers Insurance is one of about 100 beta testers for Chatter. The insurer has multiple stakeholders working on different aspects of its marketing strategies -- and Chatter fits the solutions bill, said Mitch Varhula, marketing consultant at Farmers.

"With Salesforce Chatter, the advertising department has increased collaboration across marketing initiatives and improved service levels to our internal customers of more than 14,000 agents," Varhula said. "What's been most amazing about Chatter is how it leverages a secure platform with a trusted sharing model to...

www.cio-today.com | 6/22/10 4:08 PM
EarthSearch to Unveil LogiBoxx at 8th Annual 3PL Summit in Atlanta

OTCBB : ECDC ), announced today that it will perform the first industry demonstration of its LogiBoxx device at the 8th Annual 3PL Summit sponsored by eyefortransport, a part of FC Business Intelligence.

www.topix.net | 6/22/10 1:32 PM
Planet Eclipse: EclipseLive: Upcoming Event: New BIRT 2.6 Features in Helios
Event Date: July 8, 2010 1:00 pm GMT-8

Register Now

Jason Weathersby (Actuate)
 
Abstract:

This webinar introduces new features provided by the Business Intelligence and Reporting Tools (BIRT) project with its 2.6 release. BIRT is a powerful reporting framework that is part of the open source Eclipse initiative. Using BIRT, developers can incorporate reports into their applications without the need for time-consuming custom code, or they can build on and extend BIRT to provide valuable reporting services for their applications and products.

This session details improvements in the Helios release, such as the addition of a Polar/Radar style charts, palette hashing, sorting improvements, multiple resource files and class path enhancements. The following topics will be covered:

  • BIRT Overview and Architecture
  • What is BIRT
  • High Level BIRT Architecture
  • What’s new with BIRT 2.6
    • Charting Improvements
    • Resource Files
    • Classpath Enhancements
    • Sorting Improvements
    • SQL Query Builder
    • View Time Variable Evaluation
    • Overridden Library Properties

Total running time will be approximately 1 hour

9:00 am PDT / 12:00 pm EDT / 4:00 pm GMT - Convert to other time zones


delicious delicious | digg digg | dzone dzone
live.eclipse.org | 6/21/10 3:30 PM
Apex starts hyperion online training from its india's development center

Hyperion is a Business Intelligence and Business Performance Management Tool. Its the market leader in Financial, Operational and Strategic Planning.hyperion training

www.topix.net | 6/19/10 10:29 PM
AvePoint Supports Microsoft SharePoint Server 2010

Microsoft SharePoint, a software platform for collaboration and Web publishing includes capabilities such as content management system, business intelligence, search, wikis, blogs and application development.

www.topix.net | 6/19/10 9:04 AM
One reason for BI failure

What can explain business intelligence's poor adoption rate? Are tools not easy to use? Or is there a deeper reason? A book from 2000, The Social Life of Information by John Seely Brown and Paul Duguid, suggests that BI designers have neglected basic human needs.

www.topix.net | 6/17/10 2:34 AM
Crowdcast Brings Fun and Games to Business Intelligence

Crowdcast Brings Gaming to Business Intelligence Size matters—when it comes to business intelligence. The larger the company, the more complicated it can be to harvest the right knowledge from the right people. Crowdsourcing is the obvious answer here, but if you’re bored with the usual approach, a solution called Crowdcast aims to shake things up with a gambling-flavored twist. 

Read full story... www.cmswire.com | 6/14/10 3:30 PM
Pentaho Open Source BI Goes On-Demand
Pentaho has taken its open source business intelligence (BI) widgetry and, through the wonders of VMware virtualization, made it into an on-demand subscription service that it fancies will remove critical barriers to BI adoption by giving customers control over how the solution is deployed and managed – by Pentaho, by the customer, or by both. See, it’s solved the portability issue because the customer can move the image in-house. It’s also turned the venture into a marketing gimmick called the 72-Hour Challenge. For a couple thousand bucks, window shoppers can send Pentaho their data and it promises to have an evaluation project up and running with key performance indicators and relevant dashboards in three days. The potential customer can then show the prototype around in a live webinar and test-drive it for three weeks, even expand it, without commandeering any of its own hardware or personnel.

read more

opensource.sys-con.com | 6/11/10 3:00 PM
Pentaho Open Source BI Goes On-Demand
Pentaho has taken its open source business intelligence (BI) widgetry and, through the wonders of VMware virtualization, made it into an on-demand subscription service that it fancies will remove critical barriers to BI adoption by giving customers control over how the solution is deployed and managed – by Pentaho, by the customer, or by both. See, it’s solved the portability issue because the customer can move the image in-house. It’s also turned the venture into a marketing gimmick called the 72-Hour Challenge. For a couple thousand bucks, window shoppers can send Pentaho their data and it promises to have an evaluation project up and running with key performance indicators and relevant dashboards in three days. The potential customer can then show the prototype around in a live webinar and test-drive it for three weeks, even expand it, without commandeering any of its own hardware or personnel.

read more

dotnet.sys-con.com | 6/10/10 9:00 PM
Crowdcast Raises $6 Million To Make Business Intelligence More Social
Crowdcast , a provider of social business intelligence (SBI) solutions, has raised $6 million in Series A funding. The round was led by Menlo Ventures , with participation from Alsop-Louie Partners . The company's name is a combination of the words 'crowd' and 'forecast', and its self-declared goal is to help companies find out what their people really know about their businesses, in order to make better strategic leadership decisions. techcrunch.com | 6/10/10 1:01 PM
Aptech Celebrates 40th Anniversary - Propelling Hospitality from Mainframes to Cloud Computing

'We Have Your Back and Do What It Takes to Make You Successful:' Jay Troutman, president of Aptech Computer Systems 'Jay Troutman accepting Aptech's independent national sales distributorship for WANG computers in 1984' Aptech Computer Systems, Inc ., the leading provider of hospitality software for business intelligence and enterprise financial ...

www.topix.net | 6/10/10 9:45 AM
Self-Service, Geospatial Business Intelligence Revealed at the Microsoft BI Conference

IDV Solutions today announced their participation at Microsoft's Business Intelligence Conference in New Orleans, LA.

www.topix.net | 6/10/10 5:40 AM
Apple Debuts iPhone 4; Sap Bi Image Gallery

Image Gallery: SAP Unveils Integrated BI Strategy Roadmap Laying out an ambitious roadmap at its SAPPHIRE NOW conference in May, SAP executives detailed an in-memory, tech-powered plan to unite enterprise applications and business intelligence.

www.topix.net | 6/8/10 5:15 PM
Mashboards: Better, Faster Decisions at the Web-BI Intersection
We've recently seen a resurgence of interest and coverage in predictive analytics. If making better business decisions is the primary goal, most organizations -- large and small -- could benefit from a modern, simple, mainstream business intelligence tool: the mashboard. www.crmbuyer.com | 6/7/10 1:00 PM
Major Oracle BI Launch Coming Next Month (PC World)
PC World - Oracle is planning to release one of the most significant updates to its BI (business intelligence) platform in years at a July 7 event in London, according to the company's Web site. us.rd.yahoo.com | 6/3/10 6:40 PM