SEARCH

From iMacros
Revision as of 14:19, 7 March 2014 by Td (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

The SEARCH commands works with page source, instead of looking at the web page object model (DOM) that the TAG command uses. Thus SEARCH also "sees" e. g. comments in the page source code.

Syntax iMacros Browser IE Plug-in Firefox

SEARCH SOURCE=(TXT|REGEXP) IGNORE_CASE=YES EXTRACT=$1

Notes

  1. The default is IGNORE_CASE=NO
  2. Extracted information is always of type text, as the SEARCH command works on the page source code. $1 refers to the Regular Expressions syntax. So the use of EXTRACT requires the use of the REGEX attribute.
  3. The REGEXP attribute works with standard Regular Expressions (often called regex or regexp).
  4. Global, iterative matching is currently not supported, so only the first match on the page can be found and extracted.
  5. iMacros Browser and iMacros for IE use the .NET regexp engine and since the backslash is an escape sequence by itself, its occurrence has to be escaped as well. For example: to specify \S in your regular expression, you should use \\S.
  6. iMacros for Firefox uses the Javascript regexp engine, which does not support lookbehind notation: "(?<=ABC)".

Examples

Test macro: This macro checks for the presence of the Google Analytics script on our website and then extracts the ID of it.

URL GOTO=http://imacros.net/
'Check for GA Object
SEARCH SOURCE=TXT:"'GoogleAnalyticsObject'"
'Check for GA Object, ignore case
SEARCH SOURCE=TXT:"'googleanalyticsobject'" IGNORE_CASE=YES
'Use regular expression to extract data
SEARCH SOURCE=REGEXP:"ga\\('create', '([^']+)', 'imacros.net'\\)" EXTRACT="Tracker is $1"
'Display extracted data
PROMPT {{!EXTRACT}}

Note: REGEXP:"ga\\('create', '([^']+)', 'imacros.net'\\)" means:

  • find the word ga followed by an opening brace \\( and the first parameter value 'create' followed by a comma and the start of the next parameter
  • ( starts a capturing group
  • [')]+ followed by anything except a single quote [^'] repeated one or more times +
  • ) ends the capturing group
  • ' and ending with a single quote
  • => it extracts the second argument of the ga command (the text between the '...'), which is our tracker id
  • The result is saved in $1, which is then assigned to the !EXTRACT variable

Another example: Search for a script. Escape " with \"

 SEARCH SOURCE=TXT:"<script src=\"/js/datevalidator.js\""

Related Forum Posts

See Also