Archive for the ‘general’ Category

More chart improvements in 1.1.5

Sunday, May 27th, 2012

Charts now have the option of rotated (vertical) x-axis labels. This can be useful for longer labels.

Rotated labels

Rotated labels

Note – if you have upgraded SOFA rotated labels may not work unless you update the sofastats_charts.js file in your local sofastats folder e.g. C:\Users\username\sofastats\reports\sofastats_report_extras with the sofastats_charts file for sofastats_report_extras

Scatterplots now focus on the data better by starting axes just below the minimum x and y values of the data unless the value is close enough to 0 to make it worth using 0 anyway.

Scatterplot focus

Scatterplot focus

And for Ubuntu users, a much nicer launcher icon :-). Actually, it’s a set of icons at different resolutions so that SOFA always looks good on the launcher.

More attractive launcher icon in Ubuntu

More attractive launcher icon in Ubuntu

Other changes include:

  • Numeric values are right justified in data tables.
  • Kurtosis values in the normality test include the Fischer adjustment (subtracting 3).
  • Duplicated field names in imports are given unique suffixes and allowed (now that they are unique).
  • Excel importing now handles times without dates.
  • More date formats are accepted when importing data.
  • Better guidance on data preparation before importing data.
  • More robust handling of variable definition files if corrupted.

Note: if upgrading on Linux, the two user folders (sofastats and sofastats_recovery) may be shifted from inside your home folder to a better location e.g. “/home/username/Documents” if free desktop standards are supported. After upgrading you may wish to manually replace the contents of the new folders with the contents of the old ones.

Bug Fixes

  • Fixed small bug stopping column labels displaying in data table view.
  • Fixed bug in recode operation which would wipe the table if any errors at all where encountered trying to turn the user recode config into SQLite update clauses.
  • Fixed bug in getting structured data e.g. for line charts, where a user names a field freq and thus has a conflict with my own freq field. Renamed the internal use field _sofa_freq to prevent collisions.
  • Creating user’s default proj file now copes with apostrophes etc in user path e.g. /Users/Jim’s/etc.
  • The project dialog now displays the default report and css details saved with it from previous occasions.
  • Project settings are only applied if the project is selected – they are not automatically triggered by changes when configuring a project.
  • Multi-line values entered into data cells e.g. variable label settings, automatically have the line breaks converted into spaces. Prevents errors in display of data e.g. in single line text boxes, and problems storing in python scripts (EOL error) etc.
  • Fixed bug where the first SQLite database in a project was assumed to be the default sofa database even though it might not be. Now possible to link to multiple default databases e.g. testing copies etc as long as simple naming convention followed.
  • SOFA now rolls back to last good database connection if a failure.
  • Fixed strange bug where default database would lock if made a new table, then looked as design, then tried to write to the database e.g. importing, editing data. Just refreshed cursor after updating demo table design and problem gone.

Further improvements in 1.1.4

Friday, February 24th, 2012

The latest version adds a range of improvements:

  • Added lower and upper quartiles to Row Stats report tables.

    Quartiles

  • Box plots now start y-axis from just below the minimum y value of the data unless the content is close enough to the bottom of the graph to make it worth using 0 anyway.
    Y axis adjusted automatically  for box and whisker plots
  • Showing the percent sign in percent columns for report tables is now optional – which is good news for many dissertation students.

    Show (or hide) percentage symbols

  • SOFA now displays value labels sorted by the numerical version of numbers even if stored as text. So no more 1, 11, 2,3 etc in cases where people have stored the number as a Text data type.
  • Added some more valid US date formats using dot dividers.
  • New help button for importing data.
  • New help button to advise on how to make of flexible data filters.
  • English translations are handled better (no more messages about not having US English and using UK English instead etc).

Plus there are some useful bug fixes:

  • Fixed bug where getting observed values e.g. for chi square test, fell over when one field in pair had missing values while the other didn’t.
  • Fixed bug in calculation of upper and lower whiskers in box plots.
  • Single bar charts don’t show a bar title anymore – only needed if multichart.
  • Fixed bug which only changed variable definitions when the extra settings dialog was closed with OK and didn’t ever set it otherwise e.g. when changing the selected project.
  • Now copes with newer versions of matplotlib on Linux.
  • No longer stores empty strings as variable labels if user doesn’t enter a label.

Honey I Shrunk the Installers

Monday, December 19th, 2011

The SOFA installers for Windows and Mac have shrunk substantially – from 43MB to 25MB for Windows and from a rather hefty 85MB to 36MB for Mac. They’ll be quicker to download, and the new installers also avoid possible conflicts with other Python packages on a system. It’s all self-contained. A final benefit is that the installation process itself has become much simpler, with much fewer steps. For those who are technically minded, it is thanks to pyinstaller and py2app (with some initial help from Gui2exe).

Mainstream German Computer Magazine Reviews SOFA

Sunday, December 18th, 2011

SOFA has been reviewed and included in the software CD for a recent edition of Germany’s c’t magazine (c’t 2011 Issue 26 p.118). C’t (Magazin für Computertechnik) has a sold circulation of about 367,000 so it was wonderful to show up on their radar.

c't magazine cover

Better installation in non-English environments

Wednesday, November 23rd, 2011

Version 1.1.2 fixes a bug which affected people trying to install SOFA into many non-English environments. SOFA also has some changes which make it safe for SOFA to communicate progress in more detail while being run in Windows using the non-console version of Python. Overall, SOFA has become much more robust in recent versions.

Good news for Mac & Linux users – Excel importing added

Sunday, October 9th, 2011

SOFA Statistics 1.1.1 brings good news for Mac and Linux users. You can now import Excel xls files directly. This is no longer a Windows-only feature.

Here is the full list of changes:

  • Excel can be imported from Mac and Linux as well as Windows.
  • ODS importing now copes with single ‘divider’ columns – i.e. columns with no field name in the header.
  • CSV importing now autofills blank columns with field numbers such as Var018.
  • More informative if locale issues.
  • More informative if unable to connect to MySQL on Mac.
  • Changed standard deviation in report tables from population sd to sample sd.

There is one important set of bug fixes which allows more sophisticated extraction of cell values from ODS spreadsheets. SOFA now copes with formatted content of cells and other complex cases by handling subelements in the XML.

Version 1.1.0 brings it together

Saturday, August 20th, 2011

Version 1.1.0 finally brings it together adding some of the last features to round out the original vision for the application. The main change is much easier access to data – users can now open data tables from anywhere the table can be selected e.g. charts, report tables, statistical analyses.

Open your data from anywhere

Another change makes it easier to import from spreadsheets – SOFA now gives a preview of the first few rows of data to make it easier to determine whether there is a header row or not:

Spreadsheet sample

The two extra changes are: Importing from Google Doc spreadsheets now automatically starts import process if downloading was successful; Windows users can install into any folder now, not just one with sofastats in the name.

There are also a couple of bug fixes: Fixed bug when trying to display feedback on resizing operation on data table from dialogs other than data select; and fixed regression when running data list report tables.

Here are all the major feature changes since version 1.0 was released:

  • Single line charts now have option of a trend line and data smoothing (weighted rolling average).

    New options for line charts

  • Averages can be displayed for most chart types e.g. a line chart of average income by month.

    Chart Averages

  • Attractive and dynamic Box and Whisker plots added.

    Box and Whisker Plot button

    Box and Whisker Plot

  • Much easier access to data – can now open data table from anywhere the table can be selected e.g. charts, report tables, stats analyses.
  • Numerous usability improvements and bug fixes.

I hope you really like it.

1.0.7 Much easier data entry; better support for non-English text

Thursday, July 28th, 2011

It is now a lot easier and more pleasant to enter data directly into SOFA. Check it out and see if you agree. It is also easier to get CSV data in if there are lots of fields. Overall this is an incremental step forwards rather than the introduction of lots of new features. Here is the full list of improvements:

  • Much easier and quicker data entry. Return key now functions like Tab in data entry tables. Deleting a cell automatically inserts the appropriate value.
  • Much faster importing of csv files with lots of fields. Now has option of quickly checking field names collectively (in batches under the surface) rather than individually.
  • Improved feedback to user if problem in early stages starting SOFA. Program now makes an error text file on the user desktop as well.
  • All field or table name checks in SQLite now return the SQLite error text as well.
  • Better message to user if installation of wx backend for matplotlib missing.
  • If cancel process of changing file used to define variable config, report table display no longer reverts to random demo.

and bug fixes:

  • Fixed bug in chi square when no labels set for numerical variables. Needed to convert value to unicode before using as label.
  • Fixed bug when importing datetimes with ‘T’ as the separator between date and time.
  • Fixed bug caused by SQLite queries sometimes returning strings instead of floats when extracting REAL (numeric) data. Fixed it where it affected Row Stats medians and std devs; and statistical tests.
  • Fixed bug when uwhisker and lwhisker not set. Also copes better when no boxes are displayed in boxplot.
  • Handling Python 2.6 unicode keyword bug.
  • Replaced pprint.pformat where it messes up unicode e.g. user paths with non-ascii characters. Misc other changes to fix internal issues.
  • Fixed bug allowing None to be displayed in Val A and Val B drop-downs under Group by e.g. ANOVA.
  • Config dialog in Report Tables widened slightly when needed to display title.
  • Fixed bug when decimal entered into value label list for an integer field.
  • Fixed CSV import bug when trying to guess whether a header or not.

Thanks to all the users who helped identify and resolve problems.

50,000 downloads – another milestone

Monday, June 6th, 2011

SOFA Statistics had its 50,000th download today, and last month had over 4,600 downloads – a new record.

Download milestone

Box and Whisker Plots in version 1.0.6

Saturday, June 4th, 2011

Box and Whisker plots have been added.

Box and Whisker Plot button

In keeping with the SOFA ethos these have been made as attractive as possible:

Box and Whisker Plot

As with all SOFA output, the Box and Whisker Plots are themed and, like other charts, the content is dynamic and interactive. A minor feature added to this version is a warning about the need to include the “sofa_report_extras” subfolder to see charts.

The following bug fixes have been made:

  • Fixed bug preventing comma being used as decimal separator when importing data.
  • Fixed bug when making line charts with averages – shows smoothed data line and trend line appropriately. Also only enables or disables checkboxes for smoothed data line and trend line where appropriate.
  • Better handling of comtype errors.
  • Fixed bug in histogram code when no default limits supplied.