CQPweb User Page
Latest news
  • Version 3.3.17, 2022-09-02

    New feature: sort and uniq options added to the tabulation output control.

  • Version 3.3.16, 2022-07-22

    Concordance Download now has an option to include the categories of a Categorised Query.

    New system for handling compressed files implemented, and used to make the Export Corpus tool work better.

  • Version 3.3.15, 2022-05-04

    Extended Corpus Installer plugins to make it possible for customised questions to be asked of the user.

  • Version 3.3.14, 2022-04-27

    Fixed multiple bugs in the Lexical Growth Curve tool (among other bits and pieces).

  • Version 3.3.13, 2022-04-07

    Added pagination buttons to extended context view.

    Refined mechanism for right-to-left text display, so that it works properly to display aligned parallel data where one corpus is right-to-left and the other isn't.

    Added buttons for sequential access to concordance extended context (“previous“ and “next“).

    Added five new colour schemes (palettes chosen by Isolde van Dorst).

    Fixed a bug in the HTML cache.

    Fixed a bug that prevented people accessing corpora shared by their colleaguates.

  • Version 3.3.12, 2021-12-23

    Added an HTML cache for improved performance.

    Fixed many bugs in the plugin system for user’ own corpora, making it actually fit for general use for the first time.

  • Version 3.3.11, 2021-11-09

    Enabled use of “data-” attributes on <span> tags in XML visualisations.

  • Version 3.3.10, 2021-10-14

    Implemented the 'TAB Optimisation' in the CEQL Parser, which makes certain types of query run up to 10 x faster (see https://www.linguistik.phil.fau.de/2021/07/18/tab-optimisation-cqpweb/).

    Fixed many, many bugs. These include: a bug in the display of the Query History for queries run in unsaved subcorpora; a bug in concordance download affecting download of metadata columns; a bug affecting the display of Corpus Installer plugins; a bug affecting AutoSave in categorised queries; and assorted others.

  • Version 3.3.9, 2021-02-10

    Improved subcorpous and restriction management.

  • Version 3.3.8, 2021-02-06

    Made the Restriction Cache Control render more quickly.

    Fixed a whole lot of bugs arising from new features in the last two versions.

  • Version 3.3.7, 2020-07-13

    Fixed a longstanding bug in extended context display alternative view where spaces in the alternative attribute caused words to disappear. As a result, the latest version of CWB (3.4.26) is now required.

    New feature: it's now possible to request arbitrary annotations to be included within the concordance/extendedcontext.

    New feature: it's now possible to create a subcorpus by direct upload of corpus position interval data.

    Made it possible to download files from upload areas.

    Improved the framework for running user-written CQPweb Apps for specialised tasks.

    Implemented revised CEQL improvements (in both the built-in PHP module and the external Perl implementation). As a consequence, installations that use the Perl CEQL parser now require the most recent version of CWB-Perl.

    Fixed bugs in user corpus setup; query merging; matrix creation; and elsewhere.

    Did a whole lot of cleanup and reorganisation.

    Added new function, run_query(), to the API. This is the core of the API, although many data access functions remain.

  • Version 3.3.6, 2020-04-24

    Added two new query modification tools: expand and narrow.

    Expand wraps around the CQP “expand to s-attribute” command for modifying queries.

    Narrow changes a multiword result to a single token per hit (either the original leftmost position or the original rightmost position).

    Categorised query concordances now autosave changes to categories (by default; can be turned off in user settings, or in the categorise-mode concordance).

    Made distribution run faster in queries with many, many texts.

    Fixed some major bugs in the wordcloud view of keywords (plus an assortment of minor bugs).

  • Version 3.3.5, 2020-04-15

    New feature: users can now change the email address associated with their account.

    New user setting: preferred keywords astatistic (Log Ratio by default).

    Fixed bug in query-combination; also made fixed bugs in admin view of users' Query History.

  • Version 3.3.4, 2020-04-02

    Bug fixes to internals (mostly table naming) and improvements to data structures for different kinds of saved data.

    Some tweaks to the keywords interface.

  • Version 3.3.3, 2020-03-31

    Improvements to the categorised query tools.

    Bugfixes galore.

  • Version 3.3.2, 2020-03-27

    Gave the collocation and distribution interfaces a bit of an overhaul.

    Yet more bugfixes.

  • Version 3.3.1, 2020-03-15

    Added a tool to the admin interface that checks for should-have-been deleted uncompressed CWB attribute files (deleting these can help save disk space).

    Added pagination to feature matrix display.

    Made it possible to build a feature matrix for texts in a particular subcorpous, rather than the whole corpus.

    Added a new kind of template: catdesc (“category-description”) templates. These are sets of category handles with associated descriptions that can be applied to any classification-type metadata field.

    Added a new keyboard shortcut to the query entry box: pressing ALT+Q now cycles through the query modes (ALT+SHIFT+Q does the same the other way).

    Plus a big wave of crucial bugfixes for the 3.3.0 upgrade.

  • Version 3.3.0, 2020-03-12

    Major upgrade: changed the database encoding to support full UTF-8 (including emoji characters) via MySQL/MariaDB's "utf8mb4" encoding, which was not available when CQPweb was initially invented.

    Also, set up a system to use the best available collation (compatible with recent versions of MySQL and MariaDB) for all stored textual data.

    Improved support for corpora in right-to-left scripts.

    Removed unneeded backwards compatibility code. This means that CQPweb henceforth requires AT LEAST v7.3 of PHP, and AT LEAST v 3.4.21 of the CWB core.

    Removed dependency on the tar and gzip programs (internal PHP functions used instead), although indexing corpora form .gz still relies on gzip (because cwb-encode uses pipes for this, rather than Zlib functions).

    Made use of the awk, sort and head tools optional, ending the Unix dependency.

    Added a "rendering cache" to simplify preservation of view-settings across different screens.

    Added a subcorpus creation method that simply converts a saved query's matches to the intervals that make up the subcorpus.

    Made it possible for annotations (p-attributes) to have their default case-sensitivity and accent-sensitivity set independently (this can also be set for annotation templates).

    Annotations, and annotation templates, can now also specify on a per-annotation basis whether a frequency table should be built for that annotation (necessary for collocations and keywords, but actually a bad thing for some kinds of p-attribute data).

    In consequence, the frequency-table management tools for corpora have been made rather more sophisticated.

    Added many more functions to the external API.

    Added a system for annotation templates to supply information for pre-indexed CWB corpora. Added a parallel system for XML templates!

    Revamped the internal debug-message systemk for maximum utility and convenience.

    Reworked the colour scheme system completely; added five new colour schemes using the new system.

    Gave a few other aspects of the user interface a proverbial "lick of paint".

    Added experimental query optimisation using CQP TAB queries to the CEQL parser.

    Added plugin activation for query download plugins (and, eventually, others).

    Added tool to view SQL daemon configuration within admin control panel.

  • Versions 3.2.42 and later versions of 3.2 contain only fixes for bugs, many back-ported from 3.3.

  • Version 3.2.41, 2020-01-09

    Finalised the (so far undocumented) “run CQPweb apps” tool.

    Improved memory usage in subcorpus manipulation.

    Fixed a ton of bugs, including some major issues in distribution tables.

  • Version 3.2.40, 2019-09-17

    Extended and streamlined the distribution tool to do a larger range of operations based on XML.

    Did some subcorpus-management optimisation.

  • Version 3.2.39, 2019-07-04

    Added a command-line script to install a corpus.

  • Version 3.2.38, 2019-06-12

    Revamped corpus access permissions for keywords, to make matching up a greater range of datasets possible.

    Made the new, internal CEQL parser into the default (so CWB-Perl is no longer necessary to run CQPweb).

    Began integration work on the new Dispersion tool (code contributed by Andressa Gomide).

    Fixed multiple bugs.

  • Version 3.2.37, 2019-06-07

    Standardised the back-end data formats; also fixed some problems with one of the plugins.

  • Version 3.2.36, 2019-05-31

    Added the "embed-page" tool as per feature request.

    Added a tool for administrators to view users' upload areas.

    Fixed multiple bugs.

  • Version 3.2.35, 2019-05-27

    Added the "lock password" tool for administrators to stop users changing their passwords.

    Added a button for administrators to reset all of a user's preferences to default values.

    Fixed multiple bugs.

  • Version 3.2.34, 2019-05-07

    Added the "colleaguation" networking system for data sharing (experimental only).

  • Version 3.2.33, 2019-04-24

    Many, many bugfixes from the previous release.

    Updated the required version of PHP from 5.x to 7.0 (the latter itself being out of date but still present in Debian).

    Made improvements to the CQP class making it easier to set system options, change registry, change data directory, etc.

  • Version 3.2.32, 2019-04-15

    Added a long-requested major new feature: users can now install their own corpora.

    Revamped the plugin system from the ground up, and added administrative tools for managing plugins, as well as some built-in demo plugins.

    Added wordcloud-view to the keywords tool.

    Added new built-in corpus statistic: 1,000 token Standardised TTR.

    Added a new collocation statistic: conservative estimate of Log Ratio (credit: Stefan Evert). Also added this statistic to the keywords tool.

    Changed the default collocation statistic and the default keywords statistic to Log Ratio.

    Added tool to manipulate Saved Queries by merging two existing queries into one.

    Added a built-in CEQL parser, allowing the dependency on Perl to be ended.

    Added additional, more complex methods for defining variables for multivariate analysis.

    Added programmatic access to CQPweb via an API (which has, so far, only a few unexciting functions; but more is on the way!)

    Added better tools for measuring disk usage for database tables, as well as a UI to optimise tables that are wasting a lot of disk space.

    Added an internal replacement for the now-very-old wztooltip.js library for rendering tips.

    Changed the required version of the CWB core to 3.4.10 or later.

    Fixed an edge-case bug related to the collation of handles in various table fields.

    Ended support for Internet Explorer; the browsers for which bugs will be accepted are henceforth Edge, Chrom[e,ium], Safari, and Firefox.

  • Version 3.2.31, 2017-10-04

    Completed the feature that adds new data to a corpus without re-indexing it (this can now be done for p-attributes as well as s-attributes and corpus metadata).

  • Version 3.2.30, 2017-10-04

    Added an alternative method to insert new p/s-attributes: via rescanning the corpus registry file.

  • Version 3.2.29, 2017-09-29

    Added a new method of corpus export: separate text files within a zip archive.

  • Version 3.2.28, 2017-09-28

    Added a way to add new data to a corpus without re-indexing it (s-attributes and corpus metadata).

  • Version 3.2.27, 2017-09-25

    Revamped the Distribution system to allow for non-text-based Distrbution statistics.

    Fixed a longstanding and embarrassing potential security hole (non-use of CQP QueryLock for user queries).

    Added Lexical Growth graph-drawing function as first of an anticipated series of data visualisation tools.

    Added a system for XML visualisation templates.

    Added an option for users to get frequency-list downloads in AntConc-compatible format.

    Added a control to the search form allowing the CQP match-strategy to be set.

    Tweaked some database structures in an attempt to optimise.

    Fixed a CEQL bug with queries of the form {LEMMA/TAG}: the fallback to searching secondary plus tertiary annotations was not working, now it is.

    Fixed multiple other bugs, including a potential data-shadow in the Restriction cache.

  • Version 3.2.26, 2016-12-23

    Added access to CQP binary files for privileged users (for archiving, etc. purposes).

    Added administrative tools for creating and editing user privileges.

    Added additional admin tools for controlling user login sessions, as well as some extra security measures to the change-password mechanism.

  • Version 3.2.25, 2016-12-15

    Added a new feature (yay!): experimental lexical growth curve analysis system.

    Fixed a bug that stopped XML values containing spaces rendering correctly in the concordance.

    Fixed a number of bugs in the storage of subcorpora containing very many texts selected individually or using the scan text metadata tool.

    Added a tool in the administrator's interface to delete the entire query history (thus, setting the usage statistics back to empty values).

    Reorganised the text metadata management screen.

    Improved error messages in the save-query system.

    Improved the gizmos for monitoring disk usage: query cache control now distinguishes user-saved data from deletable cache data, user profile view says how much disk space they are using for saved/categorised queries.

    (v 3.2.24 was a partial way-point towards 3.2.25, it gets no separate entry.)

  • Version 3.2.23, 2016-08-08

    Made parallel-corpus view work properly in categorise-query mode.

    Made the limit on the size of file a user can upload configurable via the privilege system.

    Fixed one or two bugs.

  • Version 3.2.22, 2016-07-31

    Added support for alignment attributes: display of one parallel corpus matching-region is possible in concordance/context view, inclusion of multiple parallel corpus matching-regions is possible in concordance download.

  • Version 3.2.21, 2016-07-04

    Fixed a critical bug in the code calculating confidence intervals for keywords and lockwords.

  • Version 3.2.20, 2016-07-01

    Added XML visualisation in concordance download.

  • Version 3.2.19, 2016-06-29

    Added the ability to create subcorpora from sub-text regions containing a query hit.

    Added metadata view for XML-based ID-link metadata (e.g. speaker metadata in spoken corpora).

  • Version 3.2.18, 2016-06-25

    Reimplemented the XML visualisation system.

    Made breaking paragraphs after punctuation in extended context view optional, rather than always implemented.

    Allowed "extra code files" (JS/CSS) to be added to enhance the visualisations.

  • Version 3.2.17, 2016-06-03

    Fixed a bug affecting the installation of pre-indexed corpora.

    Fixed a critical bug in the Frequency Breakdown that was getting "words to the Left" and "words to the Right" the wrong way round.

    Cleaned up the user management interface a bit.

  • Version 3.2.16, 2016-06-02

    Fixed a small but critical bug affecting stopping the system running on PHP version 7.

  • Version 3.2.15, 2016-05-31

    Tweaked the user-permissions lookup system to make pages load faster.

    Fixed some miscellaneous bugs, including one in the log-on system.

  • Version 3.2.14, 2016-04-04

    Added the “export corpus” function.

  • Version 3.2.13, 2016-04-03

    Fixed the opcode cache monitor to work with newer versions of PHP.

    Added a cache monitor for stray temporary tables.

    Made some background tweaks to speed up performance.

  • Version 3.2.12, 2016-03-23

    Added a readout of the amount of disk space used by each corpus, and a database-cache monitor.

    Made it possible to add readable descriptions for XML idlink categories.

    Added a tool to allow the administrator to upgrade the database format to InnoDB.

    Added “switched off” mode for use during database upgrades etc.

  • Version 3.2.11, 2016-03-21

    This is a (hopefully) stable version, prior to some upcoming extensive changes in 3.2.12.

    Fixed a lingering bug in the upgrade process.

    Fixed an edge-case bug in the collocation function.

    Reorganised the superuser's cache-control functions for the different types of cached data.

  • Version 3.2.10, 2016-03-06

    Added restricted-query data caching to improve performance.

    Addressed a number of other performance-related issues.

  • Version 3.2.9, 2016-03-02

    Multiple bug fixes for the previously-added subcorpus/restricted query features.

  • Version 3.2.8, 2016-02-19

    More big internal reorganisation.

    Added query restriction by conditions on corpus XML.

    Added, likewise, subcorpus creation by conditions on corpus XML.

    (v 3.2.7 was a partial way-point towards 3.2.8, it gets no separate entry.)

  • Version 3.2.6, 2016-02-06

    More effort to rework the internals for XML support.

    By popular demand: added back the "create batch of accounts" tool in the admin interface.

    Added "alternative view" to extended context, allowing historical corpora to show original spelling.

    Improved the interface for managing user accounts a bit.

  • Version 3.2.5, 2016-01-23

    Fixed numerous minor bugs.

    Made the tabulation-download system able to access s-attributes.

    Reworked some of the internals as a stepping stone to (yet more) XML support.

  • Version 3.2.4, 2015-11-19

    Finished (for now) the XML metadata management functions.

    Added plain-text download of feature matrices.

    Added monochrome view to assist visually-impaired accessibility.

    Rewrote the Distribution function for full BNCweb-style functionality (plus a fix to the broken sort buttons).

    Fixed a potential security bug in the signup and persistent-login systems.

    Fixed multiple minor bugs from the big 3.2 upgrade.

    Fixed a bug affecting thinned queries, and another affecting collocate highlighting in concordances.

  • Version 3.2.3, 2015-10-15

    Added new feature: restricted access to corpora.

    Metadata templates now work properly.

    Added list of CWB attributes to query forms (displays when "CQP syntax" is selected).

    Fixed more bugs.

  • Version 3.2.2, 2015-10-09

    Fixed a cluster of bugs, including some critical, in the corpus setup process.

    Added metadata templates (some functionality incomplete).

    And finally: added two new colour schemes just for the hell of it (there's more to life than databases, tha knows).

  • Version 3.2.1, 2015-10-07

    Reorganised the "manage metadata" functions.

    Added management functions (not yet entirely complete!) for XML metadata.

    Added potted explanations of collocation statistics to the collocation screen.

    Fixed a serious bug in the Distribution / Restricted Query functions, and a minor bug in the keywords download format for Log Ratio.

  • Version 3.2.0, 2015-10-01

    Reorganised the architecture in preparation for some new features.

    Added new and better user account management tools in the admin control panel.

    Rewrote indexing of s-attributes to support improved XML metadata features (upcoming!)

    Added new feature of XML templates: pre-set patterns of XML elements/attributes.

    Indexing of a corpus can now be done using a template for XML.

  • Version 3.1.16, 2015-06-08

    Fixed a number of bugs in the frequency-list generation functions.

    Fixed a bug causing an inordinate number of warning messages to be printed.

    Improved the display of factor analysis output.

  • Version 3.1.15, 2015-03-30

    Added embedded-image (or, embedded webpage) functions for text metadata.

  • Version 3.1.14, 2015-03-25

    Drastically improved the "cache control" and "monitor MySQL" interfaces in the Admin control panel.

  • Version 3.1.13, 2015-01-30

    Fixed a critical bug (namespace clash with PHP's "intl" module).

  • Version 3.1.12, 2015-01-09

    Wrote new help system: YouTube tutorial videos are now directly embedded in the "Help" pages.

  • Version 3.1.11, 2014-11-18

    Bug fix update (bugs affecting frequency lists, error reporting, and other fairly dull stuff).

  • Version 3.1.10, 2014-09-03

    Some minor performance tweaks and improved error reporting in the background.

    Updated the bug report screen.

  • Version 3.1.9, 2014-06-20

    Added new feature: factor analysis of a feature matrix derived from saved queries.

    Reorganised the JavaScript code to allow for a more user-friendly interface.

  • Version 3.1.8, 2014-06-16

    Added new feature: adjust individual user permissions on creation of frequency lists for subcorpora.

    Updated frequency breakdown to enable breakdown of any concordance position within 5 tokens (as well as the node).

    Fixed keyword download bug (no confidence interval prinout).

    Fixed critical cache leak bug.

    Improved cache control display.

  • Version 3.1.7, 2014-04-28

    Reorganised the keyword output screen to give a richer analysis.

    Added experimental effect-size statistics for keyness: Log Ratio unfiltered, and Log Ratio with LL or CI Filter.

    Added tool to extract lockwords using Log Ratio.

    Added the same Log Ratio with LL filter to the Collocations tool.

    Fixed a bug affecting collocation/sorting done on uploaded queries.

  • Version 3.1.6, 2014-04-11

    Fixed a bug affecting collocations in subcorpora of very large corpora and making the process take hours to run.

  • Version 3.1.5, 2014-03-31

    Added annotation templates, and interface for controlling them.

    Added basic cache control mechanism to admin interface.

    Added query-page link to YouTube video tutorials.

  • Version 3.1.4, 2014-02-11

    Added CAPTCHA to account-creation process.

  • Version 3.1.3, 2014-02-03

    Added bulk-add function for assigning users to groups en masse.

  • Version 3.1.2, 2014-01-31

    Gave the admin control panel a spring-clean, and added a facility to monitor the PHP opcode cache. Plus more bug fixes!

  • Version 3.1.1, 2014-01-20

    Fixes for the inevitable bugs following a large update.

  • Version 3.1.0, 2014-01-20

    Revamped user account system.

    Added a script to automatically upgrade an existing CQPweb MySQL database to match a more recent version of the code.

    Added a script to import user groups from the old system.

    Added a script to import group privileges from the old system.

    Fixed bug affecting use of XML tags in CEQL queries.

    Rewrote configuration file format and added documentation to system administrator's manual.

    Many other miscellaneous tweaks, improvements and architectural changes.

  • Version 3.0.16, 2013-12-24

    Fixed two minor bugs in the concordance download function.

  • Version 3.0.15, 2013-11-20

    Improved background handling of frequency lists (no changes a user would notice).

  • Version 3.0.14, 2013-11-18

    Added protection against users compiling very, very large frequency tables for subcorpora or on-the-fly for collocations.

  • Version 3.0.13, 2013-11-04

    Implemented context-width restrictions for limited-license corpora.

  • Version 3.0.12, 2013-11-02

    Updated database template for newer MySQL servers.

  • Version 3.0.11, 2013-08-30

    New feature: non-classification metadata fields can now be included in a concordance-download.

  • Version 3.0.10, 2013-04-22

    Added some extra protection against possible XSS (cross-site-scripting) attacks.

  • Version 3.0.9, 2013-04-06

    Added a new feature: queries can now be downloaded as "tabulations".

  • Version 3.0.8, 2013-03-22

    Added a debugging backtrace to the error messages seen by superusers.

    Added Yates' continuity correction to the calculation of Z-score in the Collocation function.

    The usual miscellaneous bug fixes, including one affecting character encoding.

  • Version 3.0.7, 2013-03-19

    Fixed a bug affecting creation of batches of user accounts.

    Fixed a bug causing the number of hits in a categorised query to be displayed incorrectly.

    Fixed a bug causing insertion of line-breaks into queries with long lines.

    Fixed an inconsistency in how batches of usernames are created.

    Fixed a bug in the management of user groups, plus a bug affecting the installation of corpora that are not in UTF-8.

    Fixed a bug in the install/delete corpus procedures which made deletion of a corpus difficult if its installation had previously failed halfway through.

  • Version 3.0.6, 2012-05-15

    More bug fixes.

    Added a new feature: a full file-by-file distribution table can now be downloaded.

    Adjusted the Distribution interface to make it more like the Collocations interface.

  • Version 3.0.5, 2012-02-19

    Just bug fixes, but major ones!

  • Version 3.0.4, 2012-02-10

    New feature: optional position labels in concordance (just like "sentence numbers" in BNCweb) (this feature originally planned for 3.0.3 but not complete in that release).

    Extended the XML visualisation system to allow conditional visualisations (ditto).

    XML visualisations now actually appear in the concordance (but only paritally rendered: they look like raw XML).

  • Version 3.0.3, 2012-02-05

    Mostly a boring bug-fix release, with only one new feature: users can now customise their default thin-mode setting.

    Fixed a bug in concordance download function that was scrambling links to context.

    Fixed a bug in categorisation system that allowed invalid category names to be created.

    Fixed a bug in frequency list creation that introduced forms in the wrong character set into the database.

    Fixed a bug in the keyword function's frequency table lookup process.

  • Version 3.0.2, 2011-08-28

    Added the long-awaited "upload user's own query" function.

    Finished the administrator's management of XML visualisations. Coming next, implementation in concordance view.

    Made it possible for a user to have the same saved-query name in two different corpora.

    Fixed a bug that made non-reproducible random thinning, actually always reproducible!

  • Version 3.0.1, 2011-08-20

    Implemented a better system for sorting corpora into categories on the homepage.

    Fixed a fairly nasty bug that was blocking corpus indexing.

    Fixed an uninformative error message when textual restrictions are selected that no texts actually match (zero-sized section of the corpus). The new error message explains the issue more clearly.

  • Version 3.0.0, 2011-07-18

    New feature: custom postprocess plugins!

    Fixed some bugs in unused parts of the CQP interface.

    Added support for all ISO-8859 character sets.

    Version number bumped to 3.0.0 to match new CWB versioning rules, though CQPweb is in fact now compatible with the post-Unicode versions of CWB (3.2.0+).

  • Version 2.17, 2011-05-18

    Fixed a fairly critical (and very silly) bug that was blocking compression of indexed corpora.

    Added extra significance-threshold options for keywords analysis.

  • Version 2.16, 2011-03-08

    Added a workaround for a problem that arises with some MySQL security setups.

    Added an optional RSS feed of system messages, and made links in system messages display correctly both within webpages and in the RSS feed.

    Created a storage location for executable command-line scripts that perform offline administration tasks (in a stroke of unparalleled originality, I call it "bin").

    Added customisable headers and logos to the homepage (a default CWB logo is supplied).

    Fixed a bug in right-to-left corpora (Arabic etc.) where collocations were referred to as being "to the right" or "to the left" of the node even though this was wrong by about 180 degrees.

  • Version 2.15, 2010-12-02

    Licence switched from GPLv3+ to GPLv2+ to match the rest of CWB. Some source files remain to be updated!

    A framework for "plugins" (semi-freestanding programlets) has been added. Three types of plugins are envisaged: transliterator plugins, annotator plugins, and format-checker plugins. Some "default" plugins will be supplied later.

    Some tweaks have been made to the concordance download options, in particular, giving a new default download style (“field-data-style”).

    For the adminstrator, there is a new group-access-cloning function.

    The required version of CWB has been dropped back down to a late v2, but you still need 3.2.x if you want UTF-8 regular expression matching to work properly in all languages.

    Improvements to query cache management internals.

    Plus the usual bug fixes, including some that deal with security issues, and further work on the R interface.

  • Version 2.14, 2010-08-27

    Quite a few new features this time. First, finer control over concordance display has been added; if you have the data, you can how have concordance results rendered as three-line-examples (field data or typology style with interlinear glosses).

    The R interface is ready for use with this version, although it is not actually used anywhere yet, and additional interface methods will be added as the need for them becomes evident. It goes without saying that you need R installed in order to do anything with this.

    The new Web API has been established, and the first two functions "query" and "concordance" created. Documentation for the Web API is still on the to-do list, and it's not quite ready for use...

    Plus, a new function for creating snapshots of the system (useful for making backups); a "diagnostic" interface for checking out common problems in setting up CQP (incomplete as yet); and some improvements to the documentation for system administrators.

    Also added a new subcorpus creation function which makes one subcorpus for every text in the corpus.

  • Version 2.13, 2010-05-31

    Increased required version of CWB to 3.2.0 (which has Unicode regular expression matching). This means that regular expression wildcards will work properly with non-Latin alphabets.

    Also added a function to create an "inverted" subcorpus (one that contains all the texts in the corpus except those in a specified existing subcorpus).

    Plus, as ever, more bug fixes and usability tweaks.

  • Version 2.12, 2010-03-19

    Added first version of XML visualisation.

    Also made distribution tables sortable on frequency or category handle (latter remains the default).

    Also added support for CQP macros and for configurable context width in concordances (including xml-based context width as well as word-based context width).

    Plus many bug fixes and minor tweaks.

  • Version 2.11, 2010-01-20

    First release of 2010! CQPweb is now two years old.

    Added improved group access management, and a setting allowing corpora to be processed in a case-sensitive way throughout (not recommended in general, but potentially useful for some languages e.g. German).
    Also added a big red warning that pops up when a user types an invalid character in a "letters-and-numbers-only" entry on a form.
    Plus lots of bug fixes.

  • Version 2.10, 2009-12-18

    Added customisable mapping tables for use with CEQL tertiary-annotations.

  • Version 2.09, 2009-12-13

    New metadata-importing functions and other improvements to the internals of CQPweb.

  • Version 2.08, 2009-11-27

    Updated internal database-query interaction. As a result, CQPweb requires CWB version 2.2.101 or later.
    Other changes (mostly behind-the-scenes): enabled Latin-1 corpora; accelerated concordance display by caching number of texts in a query in the database; plus assorted bug fixes.

  • Version 2.07, 2009-09-08

    Fixed a bug in context display affecting untagged corpora.

  • Version 2.07, 2009-08-07

    Enabled frequency-list comparison; fixed a bug in the sort function and another in the corpus setup procedure.

  • Version 2.06, 2009-07-27

    Added distribution-thin postprocessing function.

  • Version 2.05, 2009-07-26

    Added frequency-list-thin postprocessing function.

  • Version 2.04, 2009-07-05

    Bug fixes (thanks to Rob Malouf for spotting the bugs in question!) plus improvements to CQP interface object model.

  • Version 2.03, 2009-06-18

    Added interface to install pre-indexed CWB corpus and made further tweaks to admin functions.

  • Version 2.02, 2009-06-06

    Fixed some minor bugs, added categorised corpus display to main page, added option to sort frequency lists alphabetically.

  • Version 2.01, 2009-05-27

    Added advanced subcorpus editing tools. All the most frequently-used BNCweb functionality is now replicated.

  • Version 1.26, 2009-05-25

    Added Categorise Query function.

  • Version 1.25, 2009-04-05

    Added Word lookup function.

  • Version 1.24, 2009-03-18

    Added concordance sorting.

  • Version 1.23, 2009-03-01

    Minor updates to admin functions.

  • Version 1.22, 2009-01-20

    Added support for right-to-left scripts (e.g. Arabic).

  • Version 1.21, 2009-01-06

    Added (a) concordance downloads and (b) concordance thinning function.

  • Version 1.20, 2008-12-19

    Added (a) improved concordance Frequency Breakdown function and (b) downloadable concordance tables.

  • Version 1.19, 2008-11-24

    New-style simple queries are now in place! This means that "lemma-tags" will now work for most corpora.

  • Version 1.18, 2008-11-20

    The last bits of the Collocation function have been added in. Full BNCweb-style functionality is now available. The next upgrade will be to the new version of CEQL.

  • Version 1.17, 2008-11-12

    Links have been added to collocates in collocation display, leading to full statistics for each collocate (plus position breakdown).

  • Version 1.16, 2008-10-23

    Concordance random-order button has now been activated.

  • Version 1.15, 2008-10-11

    A range of bugs have been fixed.
    New features: a link to “corpus and tagset help” on every page from the middle of the footer.

  • Version 1.14, 2008-09-16

    Not much change that the user would notice, but the admin functions have been completely overhauled.
    The main user-noticeable change is that UTF-8 simple queries are now possible.

  • Version 1.13, 2008-08-04

    Added collocation concordances (i.e. concordances of X collocating with Y).
    Also added system-messages function.

  • Version 1.12, 2008-07-27

    Upgrades made to database structure to speed up collocations and keywords.

  • Version 1.11, 2008-07-25

    Added improved user options database.

  • Version 1.10, 2008-07-13

    Added frequency list view function, plus download capability for keywords and frequency lists.

  • Version 1.09, 2008-07-03

    Added keywords, made fixes to frequency lists.

  • Version 1.08, 2008-06-27

    Added collocations (now with full functionality). Added frequency list support for subcorpora.

  • Version 1.07, 2008-06-10

    Added collocations function (beta version only).

  • Version 1.06, 2008-06-07

    Minor (but urgent) fixes to the system as a result of changes to MySQL database structure.

  • Version 1.05, 2008-05-23

    Added subcorpus functionality (not yet as extensive as BNCweb's).

  • Version 1.04, 2008-02-04

    Added restricted queries, and successfully trialled the system on a 4M word corpus.

  • Version 1.03, 2008-01-23

    Added distribution function.

  • Version 1.02, 2008-01-08

    Added save-query function and assorted cache management features for sysadmin.

  • Version 1.01, 2008-01-06

    First version of CQPweb with fully working concordance function, cache management, CSS layouts, metadata view capability and basic admin functions (including username control) -- trial release with small test corpus only.

  • Autumn 2007.

    Development of core PHP scripts, the CQP interface object model and the MySQL database architecture.