Archive for the ‘general’ Category

0.9.15 – Eye candy for Bar Charts

Sunday, August 29th, 2010

The latest version of SOFA Statistics finally delivers some visual eye candy in the form of dynamic bar charts. These look good, print cleanly, can be shared easily (anyone with a web browser can view them), and there is lots of scope for further refinement and customisation.

SOFA charts are dynamic and respond when you move the mouse over them. Unfortunately, the charting library (Dojo) doesn’t work well with WordPress, so a static snapshot has been included in this blog post instead. A live example can be found in the SOFA features page.

Static image of SOFA Bar Chart

Static image of SOFA Bar Chart

SOFA Bar Charts dialog

SOFA Bar Charts dialog

Version 0.9.15 also copes properly with upgrades e.g. on top of 0.9.14. You can download it from here: http://www.sofastatistics.com/downloads.php

Coming versions of SOFA will add new chart types (probably Pie charts and Clustered Bar Charts first) and the ability to style and otherwise configure them.

0.9.14 Min, max, range, better csv and more

Thursday, August 12th, 2010

Version 0.9.14 has three significant new features:

1) Row stats tables now include min, max, and range.

Min, Max, and Range

Min, Max, and Range

2) Report tables now visibly separate row variables:

Visible separation

Visible separation

3) CSV importing has been substantially improved. User gets option of confirming or changing auto-identified csv file delimiter and encoding. Confirming presence of a header is also integrated and the first 20 rows are displayed in a table with visible cell borders.

CSV import options

CSV import options

The full list of changes is:

  • Added min, max, and range to row stats tables.
  • Report tables are much easier to read when there are multiple row variables. Added pale line between row variables in left-most column.
  • CSV importing has been substantially improved. User gets option of confirming or changing auto-identified csv file delimiter and encoding. Confirming presence of a header is also integrated and the first 20 rows are displayed in a table with visible cell borders. The import is more forgiving of some faults e.g. missing items at the end of a row. These are automatically filled with empty strings. When errors are encountered in csv import SOFA tells user the relevant line number. Also made other minor improvements to (error) messages from csv importing.
  • Added Filter button next to Table dropdown to make filtering functionality of SOFA more accessible.
  • Much simpler installation for Ubuntu/Linux Mint users. No additional steps required now that SOFA has started using gtk-webkit.
  • ANOVAs and similar group by analyses now cope with massive tables when selecting unique values to show in group by drop downs.
  • Appropriate messages are shown to users when large tables underlying analyses are about to be run. The user can opt out.
  • Changed default for Anova from precision to speed.
  • Better messages when chopping group by values shown.
  • If SOFA has been opened by an incompatible version of Python it will give a useful message and leave a helpful text file behind with instructions for fixing the problem.
  • Better messages when Chi Square warnings about number of rows, cols, or cells.
  • Better tool tip for setting MS SQL Server host e.g. my-svr-01, 1433.
  • Added minor improvement to tool tips for data tables. It now always shows labelled value if possible and read only status as suffix as appropriate.
  • Created more robust startup process with better error messages. Users also able to run test code (e.g. sent by the SOFA developer) and get result. Great for diagnosis.
  • Overhauled start code so checks version if possible, updates version, and makes a recovery folder if not current.

There are also some important bug fixes:

  • Fixed bug where only one of multiple databases shown e.g. for MySQL, PostgreSQL, and MS SQL Server.
  • Fixed bug stopping MS SQL Server 2005 from opening.
  • MS SQL Server 2000 will now open successfully, even if master is not the default table.
  • Fixed bug with Chi Square on MySQL caused by dividing Decimal data type by float.
  • Hourglass starts earlier when saving filter options.
  • Fixed minor bug with display tool tips in read-only tables.
  • Now using gtk-webkit inside wxpython. Now displays background images in tables properly in Ubuntu/Linux Mint. Can start working on all output charting.

0.9.13 Recode data e.g. age to age group; better stats support

Saturday, July 10th, 2010

SOFA Statistics 0.9.13 has a number of exciting new features.

1) Easy recoding of data e.g. age to age group:

Recoding data e.g. age to age group

Recoding data e.g. age to age group

2) Better support when choosing statistical tests:

When a selection is made SOFA displays some helpful tips to affirm the choice made or suggest alternatives.

Helpful tips

Helpful tips

The dialog also makes it easy to answer questions about your data which will help make a decision.

Answering questions about your data

Answering questions about your data

3) Better support for importing CSV files with different data encodings:

Confirm encoding when importing csv files

Confirm encoding when importing csv files

4) Nicer Windows installer

New Windows installer

5) More flexible installation options. Anything containing ‘sofa’ is OK.

There have also been some important bug fixes:

  • Recoding copes with REMAINING keyword properly and copes with varied field types properly.
  • Generally copes better with system encodings like Chinese Traditional (big5).
  • CSV importing handles non-English much better.
  • Corrected handling of non-English characters when errors are encountered.

SOFA Statistics growing in popularity

Wednesday, June 30th, 2010

June 2010 was easily the best month for SOFA Statistics downloads with 2,296 recorded. This is very encouraging for the project and suggests there is demand for a statistics package prioritising ease of use and aesthetics.

Sourceforge downloads for June 2010

Sourceforge downloads for June 2010

The priority in coming releases is the addition of output charting. Some extra work will also go into the Statistical Test Selection Dialog to ensure the correct balance is struck between providing simple advice to beginners and recognising the multitude of factors which may be relevant to making a decision.

As always, users are encouraged to provide feedback on what they like/don’t like about the program. The goal is to make the best application possible so all feedback is welcome.

0.9.12 Integrated online help and more

Monday, June 28th, 2010

SOFA Statistics now comes with integrated online help. Clicking on the “Online Help” button opens the SOFA Statistics User Guide in the user’s default browser.

Integrated Online Help

Integrated Online Help

The goal is to make the user documentation as useful as possible – both to new users and to users trying to solve specialised problems. Over time, lots of gaps will be filled so it will be worth making repeat visits to the site.

The User Guide

The User Guide

Here is the full list of new features:

  • Now able to look at and manipulate your SOFA data using free and open source tools like Sqlite Database Browser
  • Improvements for connecting to Google Docs spreadsheets
  • Major improvements to usability around data entry
  • Now able to edit variable details from the table config dialog
  • Much more useful help when errors encountered importing csv files

Version 0.9.12 also fixes numerous bugs with the Google Docs spreadsheet

0.9.11 imports on-line Google Docs Spreadsheets

Saturday, June 12th, 2010

The latest version, 0.9.11, can import data from your on-line Google Docs Spreadsheets. I will supply more details later, but it is easy to conduct on-line surveys with Google Spreadsheets underneath. And now you can analyse those results using SOFA Statistics.

To connect to your Google account, there is a new button on the standard Import dialog.

New Google Spreadsheet button

New Google Spreadsheet button

This opens a simple dialog letting you sign into your Google account. The existing spreadsheets are listed and then you can select a worksheet. Once this is downloaded, it can be imported in the standard SOFA manner.

Download Google Spreadsheet

Download Google Spreadsheet

Version 0.9.11 also makes it easier to work with read-only columns in data tables.

Read-only Columns

Read-only Columns

The final improvement relates to filtering. If you have trouble with the flexible data filtering option, more helpful examples of correct syntax are supplied.

The main bug fix is that SQLite and MySQL now handle fields containing double quotes.

Mac package of SOFA Statistics popular

Wednesday, June 9th, 2010

In less than a week the Mac package for SOFA Statistics has been downloaded over 100 times, representing nearly a quarter of all downloads of version 0.9.10.

Example output on OS X

Example output on OS X

A positive response was expected given recent survey results at the main SOFA Statistics website – 35% of respondents wanted the Mac package as the next thing to add to the project. And about 14% of visitors to the main website are using a Mac OS. But it was still nice to see the response.

0.9.10 has Mac OS X Package

Wednesday, June 2nd, 2010

SOFA Statistics has been packaged for Mac OS X (Leopard and Snow Leopard). It only seems right that an application emphasising ease-of-use and beautiful output should be available for Apple users.

SOFA Statistics on OS X

SOFA Statistics on OS X

The installer is supplied as a mpkg (metapackage) file inside a disk image. Open the mpkg to install.

SOFA mac installer

SOFA mac installer

The same application now works much the same on Windows XP, Vista, and Windows 7, Ubuntu/Linux Mint, and Mac OS X Leopard and Snow Leopard.

SOFA controls on mac

SOFA controls on mac

Even if you are not a Mac user, there are a few extras in the new version:

  • MySQL plugin now works with older versions of MySQL e.g. 4.1.
  • MySQL now allows port details e.g. for remote connections.
  • All database plugins allow empty default database or table configuration details (in which case they’ll use the first).
  • Easier to fill in connection details in project. Tooltips (with examples) have been added to database configuration controls to make it more clear what is required from the user.
  • Better placement of dialogs on smaller screens so nothing off-screen.
  • Problems with connection details raise useful message for users.
  • SOFA gives useful message if the SOFA application path is not found because of an unexpected folder name e.g. ‘sofa stats’ rather than ‘sofa’. The Windows installer was also upgraded to give better guidance.
  • Better error messages if anything goes wrong with loading of initial images when starting SOFA.

There are also a few bug fixes:

  • Fixed bug in calculation of total frequency across rows so that missing values are excluded.
  • Background images for tables now available in Ubuntu install.
  • Fixed some minor background bugs.

You can download version 0.9.10 now: SOFA downloads

NB 0.9.10 (zero point nine point ten) is newer than 0.9.9.

10,000 download milestone reached!

Monday, May 24th, 2010

SOFA Statistics has now reached the 10,000 download milestone on Sourceforge. And the rate of downloading has been steadily increasing. In May 2009 there were only 30 downloads whereas in May 2010 there are likely to be over 50 times that. The main area of work at the moment is creating a Mac package. Follow this blog or the twitter feed (http://twitter.com/sofastatistics) to stay informed.

10000 downloads!

10000 downloads!

Increasing downloads of SOFA Statistics

Saturday, May 1st, 2010

SOFA Statistics is growing in popularity, if SourceForge downloads are any indication.

Sourceforge Download Trend

Sourceforge Download Trend

Of course it is early days yet for the project, with version 1.0 not expected until later this year, but current trends are encouraging. It seems there is a gap in the market for an open source statistics, analysis, and reporting package aimed at non-specialist statisticians.