Stars: 368
Forks: 30
Pull Requests: 55
Issues: 54
Watchers: 15
Last Updated: 2023-01-14 09:22:30
Advanced shortcode (BBCode) parser and engine for PHP
License: MIT License
Languages: PHP, Makefile, Dockerfile
Shortcode is a framework agnostic PHP library allowing to find, extract and process text fragments called "shortcodes" or "BBCodes". Examples of their usual syntax and usage are shown below:
[user-profile /]
[image width=600]
[link href="http://google.pl" color=red]
[quote="Thunderer"]This is a quote.[/quote]
[text color="red"]This is a text.[/text]
The library is divided into several parts, each of them containing logic responsible for different stages and ways of processing data:
Each part is described in the dedicated section in this document.
There are no required dependencies and all PHP versions from 5.3 up to latest 8.1 are tested and supported. This library is available on Composer/Packagist as thunderer/shortcode
, to install it execute:
composer require thunderer/shortcode=^0.7
or manually update your composer.json
with:
(...)
"require": {
"thunderer/shortcode": "^0.7"
}
(...)
and run composer install
or composer update
afterwards. If you're not using Composer, download sources from GitHub and load them as required. But really, please use Composer.
To ease usage of this library there is a class ShortcodeFacade
configured for most common needs. It contains shortcut methods for all features described in the sections below:
addHandler()
: adds shortcode handlers,addHandlerAlias()
: adds shortcode handler alias,process()
: processes text and replaces shortcodes,parse()
: parses text into shortcodes,setParser()
: changes processor's parser,addEventHandler()
: adds event handler,serialize()
: serializes shortcode object to given format,unserialize()
: creates shortcode object from serialized input.Shortcodes are processed using Processor
which requires a parser and handlers. The example below shows how to implement an example that greets the person with name passed as an argument:
use Thunder\Shortcode\HandlerContainer\HandlerContainer;
use Thunder\Shortcode\Parser\RegularParser;
use Thunder\Shortcode\Processor\Processor;
use Thunder\Shortcode\Shortcode\ShortcodeInterface;
$handlers = new HandlerContainer();
$handlers->add('hello', function(ShortcodeInterface $s) {
return sprintf('Hello, %s!', $s->getParameter('name'));
});
$processor = new Processor(new RegularParser(), $handlers);
$text = '
<div class="user">[hello name="Thomas"]</div>
<p>Your shortcodes are very good, keep it up!</p>
<div class="user">[hello name="Peter"]</div>
';
echo $processor->process($text);
Facade example:
use Thunder\Shortcode\ShortcodeFacade;
use Thunder\Shortcode\Shortcode\ShortcodeInterface;
$facade = new ShortcodeFacade();
$facade->addHandler('hello', function(ShortcodeInterface $s) {
return sprintf('Hello, %s!', $s->getParameter('name'));
});
$text = '
<div class="user">[hello name="Thomas"]</div>
<p>Your shortcodes are very good, keep it up!</p>
<div class="user">[hello name="Peter"]</div>
';
echo $facade->process($text);
Both result in:
<div class="user">Hello, Thomas!</div>
<p>Your shortcodes are very good, keep it up!</p>
<div class="user">Hello, Peter!</div>
Processor
has several configuration options available as with*()
methods which return the new, changed instance to keep the object immutable.
withRecursionDepth($depth)
controls the nesting level - how many levels of shortcodes are actually processed. If this limit is reached, all shortcodes deeper than level are ignored. If the $depth
value is null (default value), nesting level is not checked, if it's zero then nesting is disabled (only topmost shortcodes are processed). Any integer greater than zero sets the nesting level limit,withMaxIterations($iterations)
controls the number of iterations that the source text is processed in. This means that source text is processed internally that number of times until the limit was reached or there are no shortcodes left. If the $iterations
parameter value is null, there is no iterations limit, any integer greater than zero sets the limit. Defaults to one iteration,withAutoProcessContent($flag)
controls automatic processing of shortcode's content before calling its handler. If the $flag
parameter is true
then handler receives shortcode with already processed content, if false
then handler must process nested shortcodes itself (or leave them for the remaining iterations). This is turned on by default,withEventContainer($events)
registers event container which provides handlers for all the events fired at various stages of processing text. Read more about events in the section dedicated to them.If processor was configured with events container there are several possibilities to alter the way shortcodes are processed:
Events::FILTER_SHORTCODES
uses FilterShortcodesEvent
class. It receives current parent shortcode and array of shortcodes from parser. Its purpose is to allow modifying that array before processing them,Events::REPLACE_SHORTCODES
uses ReplaceShortcodesEvent
class and receives the parent shortcode, currently processed text, and array of replacements. It can alter the way shortcodes handlers results are applied to the source text. If none of the listeners set the result, the default method is used.There are several ready to use event handlers in the Thunder\Shortcode\EventHandler
namespace:
FilterRawEventHandler
implements FilterShortcodesEvent
and allows to implement any number of "raw" shortcodes whose content is not processed,ReplaceJoinEventHandler
implements ReplaceShortcodesEvent
and provides the mechanism to apply shortcode replacements by discarding text and returning just replacements.The example below shows how to manually implement a [raw]
shortcode that returns its verbatim content without calling any handler for nested shortcodes:
use Thunder\Shortcode\Event\FilterShortcodesEvent;
use Thunder\Shortcode\EventContainer\EventContainer;
use Thunder\Shortcode\Events;
use Thunder\Shortcode\HandlerContainer\HandlerContainer;
use Thunder\Shortcode\Parser\RegularParser;
use Thunder\Shortcode\Processor\Processor;
use Thunder\Shortcode\Shortcode\ShortcodeInterface;
$handlers = new HandlerContainer();
$handlers->add('raw', function(ShortcodeInterface $s) { return $s->getContent(); });
$handlers->add('n', function(ShortcodeInterface $s) { return $s->getName(); });
$handlers->add('c', function(ShortcodeInterface $s) { return $s->getContent(); });
$events = new EventContainer();
$events->addListener(Events::FILTER_SHORTCODES, function(FilterShortcodesEvent $event) {
$parent = $event->getParent();
if($parent && ($parent->getName() === 'raw' || $parent->hasAncestor('raw'))) {
$event->setShortcodes(array());
}
});
$processor = new Processor(new RegularParser(), $handlers);
$processor = $processor->withEventContainer($events);
assert(' [n /] [c]cnt[/c] ' === $processor->process('[raw] [n /] [c]cnt[/c] [/raw]'));
assert('n true [n /] ' === $processor->process('[n /] [c]true[/c] [raw] [n /] [/raw]'));
Facade example:
use Thunder\Shortcode\Event\FilterShortcodesEvent;
use Thunder\Shortcode\Events;
use Thunder\Shortcode\Shortcode\ShortcodeInterface;
use Thunder\Shortcode\ShortcodeFacade;
$facade = new ShortcodeFacade();
$facade->addHandler('raw', function(ShortcodeInterface $s) { return $s->getContent(); });
$facade->addHandler('n', function(ShortcodeInterface $s) { return $s->getName(); });
$facade->addHandler('c', function(ShortcodeInterface $s) { return $s->getContent(); });
$facade->addEventHandler(Events::FILTER_SHORTCODES, function(FilterShortcodesEvent $event) {
$parent = $event->getParent();
if($parent && ($parent->getName() === 'raw' || $parent->hasAncestor('raw'))) {
$event->setShortcodes(array());
}
});
assert(' [n /] [c]cnt[/c] ' === $facade->process('[raw] [n /] [c]cnt[/c] [/raw]'));
assert('n true [n /] ' === $facade->process('[n /] [c]true[/c] [raw] [n /] [/raw]'));
This section discusses available shortcode parsers. Regardless of the parser that you will choose, remember that:
-
, basically must conform to the [a-zA-Z0-9-]+
regular expression,[code]content[/codex]
) will be ignored, opening tag will be interpreted as self-closing shortcode, eg. [code /]
,[code]content[inner][/code]content[/inner]
) will be interpreted as self-closing, eg. [code]content[inner /][/code]
, second closing tag will be ignored,There are three included parsers in this library:
RegularParser
is the most powerful and correct parser available in this library. It contains the actual parser designed to handle all the issues with shortcodes like proper nesting or detecting invalid shortcode syntax. It is slightly slower than regex-based parser described below,RegexParser
uses a handcrafted regular expression dedicated to handle shortcode syntax as much as regex engine allows. It is fastest among the parsers included in this library, but it can't handle nesting properly, which means that nested shortcodes with the same name are also considered overlapping - (assume that shortcode [c]
returns its content) string [c]x[c]y[/c]z[/c]
will be interpreted as xyz[/c]
(first closing tag was matched to first opening tag). This can be solved by aliasing handler name, because for example [c]x[d]y[/d]z[/c]
will be processed correctly,WordpressParser
contains code copied from the latest currently available WordPress (4.3.1). It is also a regex-based parser, but the included regular expression is quite weak, it for example won't support BBCode syntax ([name="param" /]
). This parser by default supports the shortcode name rule, but can break it when created with one of the named constructors (createFromHandlers()
or createFromNames()
) that change its behavior to catch only configured names. All of it is intentional to keep the compatibility with what WordPress is capable of if you need that compatibility.All parsers (except WordpressParser
) support configurable shortcode syntax which can be configured by passing SyntaxInterface
object as the first parameter. There is a convenience class CommonSyntax
that contains default syntax. Usage is shown in the examples below:
use Thunder\Shortcode\HandlerContainer\HandlerContainer;
use Thunder\Shortcode\Parser\RegexParser;
use Thunder\Shortcode\Parser\RegularParser;
use Thunder\Shortcode\Processor\Processor;
use Thunder\Shortcode\Shortcode\ShortcodeInterface;
use Thunder\Shortcode\Syntax\CommonSyntax;
use Thunder\Shortcode\Syntax\Syntax;
use Thunder\Shortcode\Syntax\SyntaxBuilder;
$builder = new SyntaxBuilder();
Default syntax (called "common" in this library):
$defaultSyntax = new Syntax(); // without any arguments it defaults to common syntax
$defaultSyntax = new CommonSyntax(); // convenience class
$defaultSyntax = new Syntax('[', ']', '/', '=', '"'); // created explicitly
$defaultSyntax = $builder->getSyntax(); // builder defaults to common syntax
Syntax with doubled tokens:
$doubleSyntax = new Syntax('[[', ']]', '//', '==', '""');
$doubleSyntax = $builder // actually using builder
->setOpeningTag('[[')
->setClosingTag(']]')
->setClosingTagMarker('//')
->setParameterValueSeparator('==')
->setParameterValueDelimiter('""')
->getSyntax();
Something entirely different just to show the possibilities:
$differentSyntax = new Syntax('@', '#', '!', '&', '~');
Verify that each syntax works properly:
$handlers = new HandlerContainer();
$handlers->add('up', function(ShortcodeInterface $s) {
return strtoupper($s->getContent());
});
$defaultRegex = new Processor(new RegexParser($defaultSyntax), $handlers);
$doubleRegex = new Processor(new RegexParser($doubleSyntax), $handlers);
$differentRegular = new Processor(new RegularParser($differentSyntax), $handlers);
assert('a STRING z' === $defaultRegex->process('a [up]string[/up] z'));
assert('a STRING z' === $doubleRegex->process('a [[up]]string[[//up]] z'));
assert('a STRING z' === $differentRegular->process('a @up#string@!up# z'));
This library supports several (un)serialization formats - XML, YAML, JSON and Text. Examples below shows how to both serialize and unserialize the same shortcode in each format:
use Thunder\Shortcode\Serializer\JsonSerializer;
use Thunder\Shortcode\Serializer\TextSerializer;
use Thunder\Shortcode\Serializer\XmlSerializer;
use Thunder\Shortcode\Serializer\YamlSerializer;
use Thunder\Shortcode\Shortcode\Shortcode;
$shortcode = new Shortcode('quote', array('name' => 'Thomas'), 'This is a quote!');
Text:
$text = '[quote name=Thomas]This is a quote![/quote]';
$textSerializer = new TextSerializer();
$serializedText = $textSerializer->serialize($shortcode);
assert($text === $serializedText);
$unserializedFromText = $textSerializer->unserialize($serializedText);
assert($unserializedFromText->getName() === $shortcode->getName());
JSON:
$json = '{"name":"quote","parameters":{"name":"Thomas"},"content":"This is a quote!","bbCode":null}';
$jsonSerializer = new JsonSerializer();
$serializedJson = $jsonSerializer->serialize($shortcode);
assert($json === $serializedJson);
$unserializedFromJson = $jsonSerializer->unserialize($serializedJson);
assert($unserializedFromJson->getName() === $shortcode->getName());
YAML:
$yaml = "name: quote
parameters:
name: Thomas
content: 'This is a quote!'
bbCode: null
";
$yamlSerializer = new YamlSerializer();
$serializedYaml = $yamlSerializer->serialize($shortcode);
assert($yaml === $serializedYaml);
$unserializedFromYaml = $yamlSerializer->unserialize($serializedYaml);
assert($unserializedFromYaml->getName() === $shortcode->getName());
XML:
$xml = '<?xml version="1.0" encoding="UTF-8"?>
<shortcode name="quote">
<bbCode/>
<parameters>
<parameter name="name"><![CDATA[Thomas]]></parameter>
</parameters>
<content><![CDATA[This is a quote!]]></content>
</shortcode>
';
$xmlSerializer = new XmlSerializer();
$serializedXml = $xmlSerializer->serialize($shortcode);
assert($xml === $serializedXml);
$unserializedFromXml = $xmlSerializer->unserialize($serializedXml);
assert($unserializedFromXml->getName() === $shortcode->getName());
Facade also supports serialization in all available formats:
use Thunder\Shortcode\Shortcode\Shortcode;
use Thunder\Shortcode\ShortcodeFacade;
$facade = new ShortcodeFacade();
$shortcode = new Shortcode('name', array('arg' => 'val'), 'content', 'bbCode');
$text = $facade->serialize($shortcode, 'text');
$textShortcode = $facade->unserialize($text, 'text');
assert($shortcode->getName() === $textShortcode->getName());
$json = $facade->serialize($shortcode, 'json');
$jsonShortcode = $facade->unserialize($json, 'json');
assert($shortcode->getName() === $jsonShortcode->getName());
$yaml = $facade->serialize($shortcode, 'yaml');
$yamlShortcode = $facade->unserialize($yaml, 'yaml');
assert($shortcode->getName() === $yamlShortcode->getName());
$xml = $facade->serialize($shortcode, 'xml');
$xmlShortcode = $facade->unserialize($xml, 'xml');
assert($shortcode->getName() === $xmlShortcode->getName());
There are several builtin shortcode handlers available in Thunder\Shortcode\Handler
namespace. Description below assumes that given handler was registered with xyz
name:
NameHandler
always returns shortcode's name. [xyz arg=val]content[/xyz]
becomes xyz
,ContentHandler
always returns shortcode's content. It discards its opening and closing tag. [xyz]code[/xyz]
becomes code
,RawHandler
returns unprocessed shortcode content. Its behavior is different than FilterRawEventHandler
because if content auto processing is turned on, then nested shortcodes handlers were called, just their result was discarded,NullHandler
completely removes shortcode with all nested shortcodes,DeclareHandler
allows to dynamically create shortcode handler with name as first parameter that will also replace all placeholders in text passed as arguments. Example: [declare xyz]Your age is %age%.[/declare]
created handler for shortcode xyz
and when used like [xyz age=18]
the result is Your age is 18.
,EmailHandler
replaces the email address or shortcode content as clickable mailto:
link:
[xyz="[email protected]" /]
becomes <a href="[email protected]">[email protected]</a>
,[xyz][email protected][/xyz]
becomes <a href="[email protected]">[email protected]</a>
,[xyz="[email protected]"]Contact me![/xyz]
becomes <a href="[email protected]">Contact me!</a>
,PlaceholderHandler
replaces all placeholders in shortcode's content with values of passed arguments. [xyz year=1970]News from year %year%.[/xyz]
becomes News from year 1970.
,SerializerHandler
replaces shortcode with its serialized value using serializer passed as an argument in class' constructor. If configured with JsonSerializer
, [xyz /]
becomes {"name":"json", "arguments": [], "content": null, "bbCode": null}
. This could be useful for debugging your shortcodes,UrlHandler
replaces its content with a clickable link:
[xyz]http://example.com[/xyz]
becomes <a href="http://example.com">http://example.com</a>
,[xyz="http://example.com"]Visit my site![/xyz]
becomes <a href="http://example.com">Visit my site!</a>
,WrapHandler
allows to specify the value that should be placed before and after shortcode content. If configured with <strong>
and </strong>
, the text [xyz]Bold text.[/xyz]
becomes <strong>Bold text.</strong>
.Want to contribute? Perfect! Submit an issue or Pull Request and explain what would you like to see in this library.
See LICENSE file in the main directory of this library.