Project

General

Profile

Mod rewrite » History » Revision 48

Revision 47 (Shtirlic, 2009-03-10 16:22) → Revision 48/64 (dgt84, 2009-03-11 09:50)

h1. URL Rewrites 

 {{>toc}} 

 *Module: mod_rewrite* 

 h2. Description 

 internal redirects, url rewrite 

 *{color:red}NOTE: url rewriting does not work within a $HTTP["url"] conditional.* [http://forum.lighttpd.net/topic/1092#3028]  

 h2. Options 

 h3. url.rewrite-once 

 Rewrites a set of URLs internally in the webserver BEFORE they are handled. 
  
 e.g. 

 <pre> 
 url.rewrite-once = ( "<regex>" => "<relative-uri>" ) 
 </pre> 

 h3. url.rewrite-repeat 

 Rewrites a set of URLs internally in the webserver BEFORE they are handled 
  
 e.g. 

 <pre> 
 url.rewrite-repeat = ( "<regex>" => "<relative-uri>" ) 
 </pre> 

 The difference between these options is that, while url.rewrite-repeat allows for applying multiple (seperately defined) rewrite rules in a row, url.rewrite-once will cause further rewrite rules to be skipped if the expression was matched. As such, url.rewrite-once behaves like Apaches' RewriteRule ... [L]: http://httpd.apache.org/docs/2.2/mod/mod_rewrite.html#rewriterule 

 The options @url.rewrite@ and @url.rewrite-final@ were mapped to @url.rewrite-once@ in 1.3.16. 

 *{color:red}NOTE: url rewriting does not work within a $HTTP["url"] conditional.* [http://forum.lighttpd.net/topic/1092#3028]  

 h2. Regular Expressions 

 * Patterns ("wildcards") are matched against a string 
 * Special characters (see [http://www.regular-expressions.info/reference.html] for reference): 

 ** . (full stop) - match any character 
 ** \* (asterisk) - match zero or more of the previous symbol 
 ** \+ (plus) - match one or more of the previous symbol 
 ** ? (question) - match zero or one of the previous symbol 
 ** \\? (backslash-something) - match special characters 
 ** ^ (caret) - match the start of a string 
 ** $ (dollar) - match the end of a string 
 ** [set] - match any one of the symbols inside the square braces. 
 ** [^set] - match any symbol that is NOT inside the square braces. 
 ** (pattern) - grouping, remember what the pattern matched as a special variable  
 ** {n,m} - from n to m times matching the previous character (m could be omitted to mean >=n times) 
 ** (?!expression) - match anything BUT expression at the current position. Example: @"^(/(?!(favicon.ico$|js/|images/)).*)" => "/fgci/$1"@ 

 * Normal alphanumeric characters are treated as normal 

 h3. Replacement Patterns 

 If the matched regex contains groups in parentheses, $1..$9 in the replacement refer to the captured text in the  
 matching group "$1" meaning the first group, "$2" the second, and so on. 

 Note that % replacements (like %1, %2, %0, etc.) in url.rewrite-* targets are permitted, but do *not* have the meaning they would have in evhost.path-pattern.    If url.rewrite-* is specified within a regex conditional, % patterns are replaced by the corresponding groups from the condition regex.    %1 is replaced with the first subexpression, %2 with the second, etc.    %0 is replaced by the entire substring matching the regexp.    See below for an example using "%0". 

 h2. Examples 

 The regex is matching the full REQUEST_URI which is supplied by the user including  

 query-string. 

 <pre> 
 # the following example, is, however just simulating vhost by rewrite 
 # * you can never change document-root by mod_rewrite 
 # use mod_*host instead to make real mass-vhost 

 server.document-root = "/www/htdocs/" 
 $HTTP["host"] =~ "^.*\.([^.]+\.com)$" { 
   url.rewrite-once = ( "^/(.*)" => "/%0/$1" ) 
 } 

 # request:          http://any.domain.com/url/  
 # before rewrite: REQUEST_URI="/www/htdocs/url/" 
 # and DOCUMENT_ROOT="/www/htdocs/" %0="any.domain.com" $1="url/" 
 # after rewrite:    REQUEST_URI="/www/htdocs/any.domain.com/url/" 
 # still, you have DOCUMENT_ROOT=/www/htdocs/ 

 # please note, that we have two regular expressions: the one which  
 # $HTTP["host"] is been compared with, and the one of the rewrite rule. 
 # the numbered subexpressions available to build the relative uri are 
 # being prefixed by '%' for subexpressions of the first regular expression  
 # match and by '$' for subexpressions of the second one. 
 # subexpression 0 interpolates the whole matching string: %0 for the whole 
 # string matching the conditional, and $0 for the whole string matching the 
 # rewrite rule. 

 # if the rewrite rule is not included in a conditional  
 # block, only the '$' prefixed variables are available. 

 url.rewrite-once = ( "^/id/([0-9]+)$" => "/index.php?id=$1", 
                      "^/link/([a-zA-Z]+)" => "/index.php?link=$1" ) 

 </pre> 

 h3. With mod_redirect 

 Rewrite rules always execute before redirect rules. This is true regardless of the order of module loading or the order of rules in the configuration (lighttpd v1.4.13). However, mod_rewrite provides a mechanism to pass URLs through unmangled: specify "$0" as the rule target. 

 e.g. 

 <pre> 
 url.rewrite-once = ( 
     "^/foo"    => "$0", 
     "^/(.*)" => "/handler/$1" 
 ) 

 url.redirect = ( 
     "^/foo"    => "http://foo.bar/" 
 ) 
 </pre> 

 h3. Workaround for "File name too long" on Windows 

 While running Lighttpd on Windows you may get @500 Internal Server Error@ if computed filename is longer than 255 symbols. 
 In error log it will be @(response.c.537) file not found ... or so:    File name too long /very_looooong_path ->@. 
 As workaround you can use @mod_rewrite@ to avoid this error. 

 <pre> 
 server.modules += ("mod_rewrite") 
 url.rewrite-once = ( ".{250,}" => "/toolong.php" ) 
 </pre> 

 If error handler is PHP, @$_SERVER['REQUEST_URI']@ will contain full URI. 

 h3. Passing / Matching the Query string (GET variables) 

 If you wanna pass the Query String (?foo=bar) to the rewrite destination you have to explicitly match it: 

 <pre> 
 url.rewrite-once = ( 
     "^/news/([^\?]+)(\?(.*))?" => "/news.php?title=$1&$3" 
 ) 
 </pre>