Current mobile devices feature such tech specs, one could only dream about a few years ago. They provide great work optimization opportunities. Despite that, many of them are used in business scenarios only occasionally. This article uncovers the utilization of mobile device's built-in camera and microphone in a photo documentation application that enables creating a voice tag. Such application has numerous ways of usage. Practically, it is suitable for any kind of work force collecting data in the field. A good example is a Field Service solution, where a service technician fills out a report when being in the field. In such situation, the photo documentation describing a technical defect is an essential part of the report. Another example can be a policeman or an insurance agent that uses mobile application for documentation of a car accident or an insurance claim.Last but not least, the solution can be hosted, for instance, by a mobile operator that can include it in their portfolio of services adding interesting value for their customers. Let's develop the application in Microsoft Visual Studio and .NET Compact Framework. In order to avoid complicated technology of picture and sound processing, we will use components from the Resco MobileForms Toolkit. These components dramatically simplify the overall development and save lots of time. After running a quick brainstorming, this is the expected functionality our application shall possess: