Quantcast
Channel: Intel® Software - Intel® VTune™ Profiler (Intel® VTune™ Amplifier)
Viewing all 1574 articles
Browse latest View live

Remote Linux port and personal shared libraries not showing in results

$
0
0

Hello, I've started using Intel VTune yesterday to profile the program I've been working on.

I installed it on my Windows 10 host machine and have been connecting remotely to the Linux Docker where the program is running.

- The first issue I have is that I want to use port 2222 to connect to the Linux Docker, like I do when connecting manually via SSH to that same Linux Docker. For that, we use ''-p 127.0.0.1:2222:22 ^'' in the ''docker run'' command.

I specified ''root@127.0.0.1:2222'' as the SSH destination but the only way it works is that I had to specifiy ''-p 127.0.0.1:22:22 ^'' in the ''docker run'' command.

- The second issue I have is that once I ran one profiling session of a few seconds, I expect to see results that take into account the personal shared libraries I'm using and that contain code that I know is executed.

However, what I see is that over tens of seconds of elapsed time, the CPU time is only 0.010s or such low values.

I added the following the BuildSettings.cmake file:

set(CMAKE_BUILD_TYPE Debug)
set(CMAKE_XCODE_ATTRIBUTE_DEBUG_INFORMATION_FORMAT "dwarf")

and when compiling, the -g appears but VTune is still now seeing any of my personal libraries.

Thank you !


Question on loads, stores and LLC miss count in Memory Access

$
0
0

Hi, all.

I have been trying to profile memory access of an application using VTune.

And I have the following questions about loads, stores and LLC miss count.

 

1. Does load and store count represents loads/stores that occurred in LLC only? or does it counts every single load and store in L1, L2 and LLC?

 

2. I think LLC miss count should be the same with the DRAM access count, but what I got is DRAM access count is larger than LLC miss count. What would make this situation?

 

I attached the images for each question. 

Thanks for answering in advance.

AttachmentSize
Downloadimage/jpegls.JPG16.81 KB
Downloadimage/jpegLLC.JPG11.8 KB

Can I see the result something like timeline-view in VTune?

$
0
0

Hi, all.

 

The attached image is from nvprof. 

I am wondering if I can see the profiling result like the attached image.

So I could see what function is used at certain time with timeline-view.

 

Thanks for answering in advance.

AttachmentSize
Downloadimage/pngtimeline-view.png54.1 KB

Have problem to run VTune Amplifer

$
0
0

I have a Fotran program uses LAPACK library in intel MKL. The program compiles and runs very well in MS Visual Studio. When I tried to analyze the program using "Vtune in Tools->  Intel Vtune Amplifier->Profle with VTune Amplifier" and start to run the program. It shows the following error message:

"The procedure entry point mkl_serv_inspector_suppress could not be located in the dynamic link library  ****" 

See attached image for error.

Can you help to fixe the issue?

Many Thanks!

 

AttachmentSize
Downloadimage/pngCapture.PNG8.9 KB

Failed to connect to VTune Amplifier data provider

$
0
0

I get the following error message when trying to load old results:

However, after deleting the old project and creating a new one, everything works as it should. I just can not close VTune Amplifier and then open it again to continue working on an old project.

127.0.0.1 is indeed excluded from my local proxy settings, and I have no local firewall running on this machine. Not sure about any additional firewalls on behalf of my company though.

Operating system is Linux, Opensuse 42.3

How can I install VTune on vLab with sampling driver?

$
0
0

Hi, all.

I have been trying to install VTune on Intel vLab.

However, it said that root permission is needed to install sampling driver and sampling driver will not be installed since it is a virtual system.

 

Is there any possible way to install VTune with sampling driver on vLab?

Can I collect data in multiple mode?

$
0
0

I'm interested in performance related to vectorization and memory access of my application, and I'm quite newbie in VTune.

 

I use two commands,

amplxe-cl -collect hpc-performance

amplxe-cl -collect memory-access

 

And, the result of each (r@@@hpc, r@@@macc directories are made) has its detailed data that the other doesn't have. But I want both. So, I'm wondering if I can get the result of both of them with one command.

Question about analyzing threading efficiency

$
0
0

Hi, I've been recently profiling PyTorch code using AVX instructions on my 16 core CPU. I'm feeling weird about the results. And I don't know one metric's meaning. Let me share my google drive link to show you my result.

https://drive.google.com/open?id=1dwy_DA6e6M9f9ruvOaR7-yFzOVvBD7jP

If 'H/W Context' is meaning physical core, is the 'VTune Result 6' image file saying that only 9 cores are working after 16000s? It's quite weird and I don't know why it happened. This kind of phenomenon happens when I increases training epoch in my python codes. (It means, increases iterating number of loop. The image files numbered 1~5 are for code with epochs 5, 15, 25, 50, 50. I tried the code of 50 epochs twice)


vtune_amplifier_target_sep_x86.tgz where to download?

$
0
0

Hi, I'm trying to install the vtune_amplifier_target_sep_x86.tgz package. Can someone please point me to where I can download the tar file? The automatic installation has failed. 

 

Thanks, 

Asha

Vtune not able to identify processor type

$
0
0

Hi, Vtune when trying to find the hotspots is not able to identify the processor type. Althought its a MAC OS with i5 installed. Can anyone post a solution for the issue? 

Thanks,

Asha

Basic questions about vtune

$
0
0

Does VTune work with using perf?

Can Vtune work on hardware which has not an x86 intel CPU inside, e.g. an embedded Samsung CPU?

I use the analyzer checking tool

Intel(R) VTune(TM) Amplifier Self Check Utility
Copyright (C) 2009-2019 Intel Corporation. All rights reserved.
Build Number: 602217

Instrumentation based analysis check
Example of analysis types: Hotspots with default knob sampling-mode=sw, Threading with default knob sampling-and-waits=sw
    Collection: Ok
    Finalization: Ok
amplxe: Warning: Cannot locate debugging information for file `/opt/intel/vtune_amplifier_2019.6.0.602217/lib64/libtpsstool.so'.
    Report: Ok

HW event-based analysis check (Perf)
Example of analysis types: Hotspots with knob sampling-mode=hw, HPC Performance Characterization, etc.
    Collection: Ok
amplxe: Warning: Access to /proc/kallsyms file is limited. Consider changing /proc/sys/kernel/kptr_restrict to 0 to enable resolution of OS kernel and kernel modules symbols.
amplxe: Warning: To profile kernel modules during the session, make sure they are available in the /lib/modules/kernel_version/ location.
    Finalization: Ok
    Report: Ok

HW event-based analysis check (Perf)
Example of analysis types: Microarchitecture Exploration
    Collection: Ok
amplxe: Warning: Access to /proc/kallsyms file is limited. Consider changing /proc/sys/kernel/kptr_restrict to 0 to enable resolution of OS kernel and kernel modules symbols.
amplxe: Warning: To profile kernel modules during the session, make sure they are available in the /lib/modules/kernel_version/ location.
    Finalization: Ok
    Report: Ok

HW event-based analysis with uncore events (Perf)
Example of analysis types: Memory Access
    Collection: Ok
amplxe: Warning: Access to /proc/kallsyms file is limited. Consider changing /proc/sys/kernel/kptr_restrict to 0 to enable resolution of OS kernel and kernel modules symbols.
amplxe: Warning: To profile kernel modules during the session, make sure they are available in the /lib/modules/kernel_version/ location.
    Finalization: Ok
    Report: Ok

HW event-based analysis with stacks (Perf)
Example of analysis types: Hotspots with knob sampling-mode=hw and knob enable-stack-collection=true, etc.
    Collection: Ok
amplxe: Warning: Access to /proc/kallsyms file is limited. Consider changing /proc/sys/kernel/kptr_restrict to 0 to enable resolution of OS kernel and kernel modules symbols.
amplxe: Warning: To profile kernel modules during the session, make sure they are available in the /lib/modules/kernel_version/ location.
    Finalization: Ok
amplxe: Warning: Cannot locate debugging information for file `/lib/x86_64-linux-gnu/libgcc_s.so.1'.
    Report: Ok

HW event-based analysis with context switches (Perf)
Example of analysis types: Threading with knob sampling-and-waits=hw
    Collection: Ok
amplxe: Warning: For analyses using the Perf-based driverless collection, the preemption and synchronization context switches may not be differentiated on kernels older than 4.17. To identify the context switch types on such kernels, switch to the driver-based collection by setting the Stack size option to the unlimited (0) value.
amplxe: Warning: Access to /proc/kallsyms file is limited. Consider changing /proc/sys/kernel/kptr_restrict to 0 to enable resolution of OS kernel and kernel modules symbols.
amplxe: Warning: To profile kernel modules during the session, make sure they are available in the /lib/modules/kernel_version/ location.
    Finalization: Ok
    Report: Ok

The system is ready to be used for performance analysis with Intel VTune Amplifier.
Review warnings in the output above to find product limitations, if any.

But I had to adjust the value in the file perf_event_paranoid to 0

Don't know really what this is needed for, but just found it in a previous run as output:

Please set the /proc/sys/kernel/perf_event_paranoid value to 0 or less to continue without installing the drivers.

 

how to analysis special lines choosed by the user?

$
0
0

For example, Vtune will analysis all codes and then draw CPU Utilization Histogram. But I only want vtune to analysis special code lines, such as 1000th-2000th code lines. How to do this?

Visual Studio 2017 Project not supported

$
0
0

Hi

I am using Visual Studio 2017 (vc 14.1).

For some VS projects, VTune amplifier works well but for the other, it doesn't.

When I try to 'Profile with VTune Amplifier' in VS IDE, it says 'The selected type of Visual Studio project is not supported by VTune Amplifier'.

Any advice for this error?

My project configuration is as follows:

<?xml version="1.0" encoding="utf-8"?>
<Project DefaultTargets="Build" ToolsVersion="15.0" xmlns="http://schemas.microsoft.com/developer/msbuild/2003">
  <ItemGroup Label="ProjectConfigurations">
    <ProjectConfiguration Include="Debug|Win32">
      <Configuration>Debug</Configuration>
      <Platform>Win32</Platform>
    </ProjectConfiguration>
    <ProjectConfiguration Include="Release|Win32">
      <Configuration>Release</Configuration>
      <Platform>Win32</Platform>
    </ProjectConfiguration>
    <ProjectConfiguration Include="Debug|x64">
      <Configuration>Debug</Configuration>
      <Platform>x64</Platform>
    </ProjectConfiguration>
    <ProjectConfiguration Include="Release|x64">
      <Configuration>Release</Configuration>
      <Platform>x64</Platform>
    </ProjectConfiguration>
  </ItemGroup>
  <PropertyGroup Label="Globals">
    <VCProjectVersion>15.0</VCProjectVersion>
    <ProjectGuid>{B5C71B6E-9DA8-4217-8F96-26FFFB9FBD74}</ProjectGuid>
    <Keyword>Win32Proj</Keyword>
    <RootNamespace>ExampleMillimiman</RootNamespace>
    <WindowsTargetPlatformVersion>10.0.17763.0</WindowsTargetPlatformVersion>
    <ProjectName>ISP_AnS_10M_Main_Run</ProjectName>
  </PropertyGroup>
  <Import Project="$(VCTargetsPath)\Microsoft.Cpp.Default.props" />
  <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'" Label="Configuration">
    <ConfigurationType>Application</ConfigurationType>
    <UseDebugLibraries>true</UseDebugLibraries>
    <PlatformToolset>v141</PlatformToolset>
    <CharacterSet>Unicode</CharacterSet>
  </PropertyGroup>
  <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Release|Win32'" Label="Configuration">
    <ConfigurationType>Application</ConfigurationType>
    <UseDebugLibraries>false</UseDebugLibraries>
    <PlatformToolset>v141</PlatformToolset>
    <WholeProgramOptimization>true</WholeProgramOptimization>
    <CharacterSet>Unicode</CharacterSet>
  </PropertyGroup>
  <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Debug|x64'" Label="Configuration">
    <ConfigurationType>Application</ConfigurationType>
    <UseDebugLibraries>true</UseDebugLibraries>
    <PlatformToolset>v141</PlatformToolset>
    <CharacterSet>Unicode</CharacterSet>
  </PropertyGroup>
  <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Release|x64'" Label="Configuration">
    <ConfigurationType>Application</ConfigurationType>
    <UseDebugLibraries>false</UseDebugLibraries>
    <PlatformToolset>v141</PlatformToolset>
    <WholeProgramOptimization>true</WholeProgramOptimization>
    <CharacterSet>Unicode</CharacterSet>
  </PropertyGroup>
  <Import Project="$(VCTargetsPath)\Microsoft.Cpp.props" />
  <ImportGroup Label="ExtensionSettings">
  </ImportGroup>
  <ImportGroup Label="Shared">
  </ImportGroup>
  <ImportGroup Label="PropertySheets" Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'">
    <Import Project="$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props" Condition="exists('$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props')" Label="LocalAppDataPlatform" />
  </ImportGroup>
  <ImportGroup Label="PropertySheets" Condition="'$(Configuration)|$(Platform)'=='Release|Win32'">
    <Import Project="$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props" Condition="exists('$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props')" Label="LocalAppDataPlatform" />
  </ImportGroup>
  <ImportGroup Label="PropertySheets" Condition="'$(Configuration)|$(Platform)'=='Debug|x64'">
    <Import Project="$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props" Condition="exists('$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props')" Label="LocalAppDataPlatform" />
  </ImportGroup>
  <ImportGroup Label="PropertySheets" Condition="'$(Configuration)|$(Platform)'=='Release|x64'">
    <Import Project="$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props" Condition="exists('$(UserRootDir)\Microsoft.Cpp.$(Platform).user.props')" Label="LocalAppDataPlatform" />
  </ImportGroup>
  <PropertyGroup Label="UserMacros" />
  <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'">
    <LinkIncremental>true</LinkIncremental>
  </PropertyGroup>
  <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Debug|x64'">
    <LinkIncremental>true</LinkIncremental>
  </PropertyGroup>
  <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Release|Win32'">
    <LinkIncremental>false</LinkIncremental>
  </PropertyGroup>
  <PropertyGroup Condition="'$(Configuration)|$(Platform)'=='Release|x64'">
    <LinkIncremental>false</LinkIncremental>
  </PropertyGroup>
  <ItemDefinitionGroup Condition="'$(Configuration)|$(Platform)'=='Debug|Win32'">
    <ClCompile>
      <PrecompiledHeader>NotUsing</PrecompiledHeader>
      <WarningLevel>Level3</WarningLevel>
      <Optimization>Disabled</Optimization>
      <SDLCheck>true</SDLCheck>
      <PreprocessorDefinitions>WIN32;_DEBUG;_CONSOLE;%(PreprocessorDefinitions)</PreprocessorDefinitions>
      <ConformanceMode>true</ConformanceMode>
      <PrecompiledHeaderFile>pch.h</PrecompiledHeaderFile>
    </ClCompile>
    <Link>
      <SubSystem>Console</SubSystem>
      <GenerateDebugInformation>true</GenerateDebugInformation>
      <AdditionalDependencies>%(AdditionalDependencies)</AdditionalDependencies>
      <AdditionalLibraryDirectories>%(AdditionalLibraryDirectories)</AdditionalLibraryDirectories>
    </Link>
  </ItemDefinitionGroup>
  <ItemDefinitionGroup Condition="'$(Configuration)|$(Platform)'=='Debug|x64'">
    <ClCompile>
      <PrecompiledHeader>NotUsing</PrecompiledHeader>
      <WarningLevel>Level3</WarningLevel>
      <Optimization>Disabled</Optimization>
      <FunctionLevelLinking>true</FunctionLevelLinking>
      <IntrinsicFunctions>true</IntrinsicFunctions>
      <SDLCheck>true</SDLCheck>
      <PreprocessorDefinitions>_DEBUG;_CONSOLE;%(PreprocessorDefinitions)</PreprocessorDefinitions>
      <ConformanceMode>true</ConformanceMode>
      <PrecompiledHeaderFile>pch.h</PrecompiledHeaderFile>
      <LanguageStandard>stdcpp17</LanguageStandard>
      <AdditionalIncludeDirectories>..\code\boost</AdditionalIncludeDirectories>        
    </ClCompile>
    <Link>
      <SubSystem>Console</SubSystem>
      <GenerateDebugInformation>true</GenerateDebugInformation>
      <AdditionalLibraryDirectories>Libs\Debug\x64\;%(AdditionalLibraryDirectories)</AdditionalLibraryDirectories>
    </Link>
    <Lib>
      <AdditionalDependencies>Libs\Debug\x64\libboost_date_time-vc141-mt-gd-x64-1_70.lib</AdditionalDependencies>
    </Lib>
  </ItemDefinitionGroup>
  <ItemDefinitionGroup Condition="'$(Configuration)|$(Platform)'=='Release|Win32'">
    <ClCompile>
      <PrecompiledHeader>NotUsing</PrecompiledHeader>
      <WarningLevel>Level3</WarningLevel>
      <Optimization>MaxSpeed</Optimization>
      <FunctionLevelLinking>true</FunctionLevelLinking>
      <IntrinsicFunctions>true</IntrinsicFunctions>
      <SDLCheck>true</SDLCheck>
      <PreprocessorDefinitions>WIN32;NDEBUG;_CONSOLE;%(PreprocessorDefinitions)</PreprocessorDefinitions>
      <ConformanceMode>true</ConformanceMode>
      <PrecompiledHeaderFile>pch.h</PrecompiledHeaderFile>
      <MultiProcessorCompilation>true</MultiProcessorCompilation>
    </ClCompile>
    <Link>
      <SubSystem>Console</SubSystem>
      <EnableCOMDATFolding>true</EnableCOMDATFolding>
      <OptimizeReferences>true</OptimizeReferences>
      <GenerateDebugInformation>true</GenerateDebugInformation>
      <AdditionalDependencies>%(AdditionalDependencies)</AdditionalDependencies>
      <AdditionalLibraryDirectories>%(AdditionalLibraryDirectories)</AdditionalLibraryDirectories>
    </Link>
  </ItemDefinitionGroup>
  <ItemDefinitionGroup Condition="'$(Configuration)|$(Platform)'=='Release|x64'">
    
  </ItemGroup>
  <ItemGroup Label="Include">
    
  </ItemGroup>
  <Import Project="$(VCTargetsPath)\Microsoft.Cpp.targets" />
  <ImportGroup Label="ExtensionTargets">
  </ImportGroup>
</Project>

how to insert picture using this forum?

$
0
0

When I use this website​​​​​​ https://imgbb.com/  as a link, it failed to upload the picture. So how to upload a picture? Thanks.

confusing result!!!

$
0
0

Look at the attached picture. Above the picture is the source code,below is the assembly code. In the source code, N is equal to  1<<30. I have the following questions.

1. The code fragment in the red box needs to execute 1<<30 times(0x40000000), why the number of retired instructions is equal to 4,302,000,000,far less than 1<<30.(the number of sample after value of CPU_CLK_UNHALTED.THREAD event and INST_RETIRED.ANY event are both 2000000)

2. Why is different between the number of retired instructions for the two assembly instructions "inc" in the red box?

3. Is the clockticks*cpu frequency equal to the elapsed time of the code snippet?

Thank you for your reply. :)

AttachmentSize
Downloadimage/pngpicture.png242.51 KB

how to get the elapsed time of every function?

$
0
0

how to get the elapsed time of every function? not CPU time.

how to optimize this strange uPipe graph!!!!!!!!

$
0
0

My program's uPipe is like the attachment. So how to optimize this program??? Thank you.

AttachmentSize
Downloadimage/pngpicture.png115.86 KB

src_kmp_barrier_cpp functiion can't find call stack information

$
0
0

Look at attachment picture. The src_kmp_barrier_cpp functiion can't find call stack information. How to solve this problem?

AttachmentSize
Downloadimage/pngpicture.png152.11 KB

vtune faced a serious problem when compare two collections

how to just watch assembly of source code instead of run someone collection first?

$
0
0

how to just watch assembly of source code instead of run someone collection first?

Viewing all 1574 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>